Deploying an LLM on AWS Lambda

This repository contains code and instructions for deploying a smaller open-source Language Large Model (LLM) on AWS Lambda, using Python, Docker. The model used for demonstration is Phi-2 from Microsoft. This project aims to demonstrate how to use serverless infrastructure for LLM inference, particularly for applications requiring processing of sensitive data or specialized tasks.

Project Overview

The project involves deploying the Microsoft Phi-2 model, a 2.7 billion parameter LLM, on AWS Lambda using Docker. It demonstrates creating an HTTP REST endpoint through Lambda's URL mechanism to provide LLM outputs with execution details.

Detailed Guide

For a step-by-step tutorial, refer to the article: How to deploy an LLM on AWS Lambda?

Key Features

Utilizes the Phi-2 model from Microsoft.
Implements docker-based AWS Lambda functions.
Demonstrates the use of the llama-cpp-python package for LLM inference.

Prerequisites for the tutorial

Basic knowledge of programming, Docker, AWS, and Python.
AWS account with AWS CLI installed and configured.
Docker installed on your machine.
A preferred IDE, such as Visual Studio Code.

Getting Started

Clone this repository to get started with deploying your own LLM on AWS Lambda. Follow the instructions provided in the tutorial to set up your environment, run a containerized LLM locally, and deploy it to AWS Lambda.

Social Media and Contact

Stay updated and reach out through the following channels:

Newsletter: Subscribe here
Twitter: @horosin_
LinkedIn: Profile

Feel free to contribute to this repository, raise issues, or suggest improvements. Your feedback and contributions are highly appreciated!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
lambda_function.py		lambda_function.py
requirements.txt		requirements.txt
test_local.sh		test_local.sh
test_remote.sh		test_remote.sh
trust-policy.json		trust-policy.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deploying an LLM on AWS Lambda

Project Overview

Detailed Guide

Key Features

Prerequisites for the tutorial

Getting Started

Social Media and Contact

About

Releases 1

Packages

Languages

horosin/llm-on-aws-lambda

Folders and files

Latest commit

History

Repository files navigation

Deploying an LLM on AWS Lambda

Project Overview

Detailed Guide

Key Features

Prerequisites for the tutorial

Getting Started

Social Media and Contact

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages