llm-server

Star

Here are 4 public repositories matching this topic...

onnx / turnkeyml

Star

Local LLM Server with NPU Acceleration

toolchain benchmark ai amd gpu local-server onnx igpu npu llm llm-server onnxruntime-genai

Updated Apr 25, 2025
Python

pikocloud / pikobrain

Star

Function-calling API for LLM from multiple providers

api gemini openai rag function-calling ollama aws-bedrock llm-server

Updated Aug 10, 2024
Go

fcn94 / llm_stream_endpoint

Star

Simple LLM Rest API using Rust, Warp and Candle. Dedicated for quantized version of either phi-2 ( default) , Mistral, or Llama. Work using CPU or CUDA

api rust streaming cpu rest rest-api cuda cuda-kernels endpoint candle quantized llm llama2 mistral-7b phi-2 llm-server

Updated Apr 2, 2024
Rust

A flexible FastAPI-based framework for handling AI tasks using Large Language Models (LLMs). Supports multiple providers, extensible tasks and routers, Redis caching, and OpenAI integration. Easily scalable for various LLM-based applications.

llm llm-server

Updated Sep 3, 2024
Python

Improve this page

Add a description, image, and links to the llm-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-server topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-server

Here are 4 public repositories matching this topic...

onnx / turnkeyml

pikocloud / pikobrain

fcn94 / llm_stream_endpoint

Slyracoon23 / llm_server

Improve this page

Add this topic to your repo