Large Language Model Utilities: llmutil

This project is a RESTful wrapper for LLM functionalities.

Usage

Nvidia Container Toolkit

In case if there is an Nvidia GPU, you need Nvidia's docker toolkit

# on Arch
yay -S nvidia-container-toolkit
sudo systemctl restart docker

Usage from Command Line

Go to the root of the project.

# or do it manually
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# to run tests
pytest

# for production
gunicorn --workers 1 --timeout 300 --bind 0.0.0.0:8000 main:app

# when all is instlled, you can use a script
# to start the server.
./run

If new libraries are added, run

pip freeze > requirements.txt

API endpoints

`/api/v1/embed`

Takes POST with

{
  "texts": [
    "first text",
    "second text",
    ...
  ]
}

Response contains an array of embeddings with 384-dimentional vectors.

Usage with Docker

From the root of the project

docker build -t gnames/llmutil:latest .

Then run:

docker run -d --workers 1 --gpus all -p 8000:8000 gnames/llmutil:latest

Do not use --gpus all option if you do not have GPU.

Testing

Tests are located in tests directory. Install pytest and run:

pytest

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
llm		llm
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run		run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Model Utilities: llmutil

Usage

Nvidia Container Toolkit

Usage from Command Line

API endpoints

`/api/v1/embed`

Usage with Docker

Testing

About

Releases

Packages

Languages

gnames/llmutil

Folders and files

Latest commit

History

Repository files navigation

Large Language Model Utilities: llmutil

Usage

Nvidia Container Toolkit

Usage from Command Line

API endpoints

/api/v1/embed

Usage with Docker

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`/api/v1/embed`

Packages