LLM deploy API

This project implements an API designed to efficiently deploy Language Model (LLM) applications using Flask API. The framework is built with modular code that allows easy integration of new open source models from a range of platforms including HuggingFace and Replicate.

Installation and Deployment

Follow these steps to deploy the LLM application:

Install the necessary requirements:

pip install -r requirements.txt

Run the file run_server.sh to deploy the models.

sh run_server.sh

Shutting Down

To stop the application, run the following command:

kill $(lsof -t -i :IP)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
eval		eval
models		models
old_code		old_code
prompt		prompt
test		test
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
llm_deploy_fastapi.py		llm_deploy_fastapi.py
model_proxy_fastapi.py		model_proxy_fastapi.py
requirements.txt		requirements.txt
run_server.sh		run_server.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM deploy API

Installation and Deployment

Shutting Down

About

Uh oh!

Releases

Packages

Uh oh!

License

davide97l/LLM-deploy-API

Folders and files

Latest commit

History

Repository files navigation

LLM deploy API

Installation and Deployment

Shutting Down

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages