1 |
ollama |
134174 |
11090 |
Go |
1474 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models. |
2025-03-22T03:21:38Z |
2 |
LLaMA-Factory |
44951 |
5498 |
Python |
395 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-03-21T02:56:48Z |
3 |
unsloth |
35447 |
2720 |
Python |
905 |
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥 |
2025-03-22T01:02:15Z |
4 |
LocalAI |
31150 |
2356 |
Go |
415 |
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-03-21T21:36:26Z |
5 |
khoj |
26747 |
1469 |
Python |
67 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-03-20T08:37:22Z |
6 |
LibreChat |
23498 |
3922 |
TypeScript |
135 |
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project. |
2025-03-21T23:26:06Z |
7 |
ludwig |
11389 |
1204 |
Python |
38 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-03-03T20:40:07Z |
8 |
OpenLLM |
11005 |
703 |
Python |
0 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-03-18T05:09:35Z |
9 |
mistral-inference |
10118 |
907 |
Jupyter Notebook |
121 |
Official inference library for Mistral models |
2025-03-20T15:03:08Z |
10 |
ipex-llm |
7595 |
1332 |
Python |
1096 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-03-21T07:52:22Z |
11 |
inference |
7147 |
587 |
Python |
170 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-03-21T07:17:53Z |
12 |
ms-swift |
6456 |
555 |
Python |
467 |
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...). |
2025-03-21T10:56:44Z |
13 |
Firefly |
6268 |
566 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
14 |
big-AGI |
6241 |
1434 |
TypeScript |
230 |
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
2025-03-21T17:06:11Z |
15 |
mistral.rs |
5270 |
380 |
Rust |
106 |
Blazingly fast LLM inference. |
2025-03-22T03:36:39Z |
16 |
enchanted |
5063 |
322 |
Swift |
88 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
17 |
opencompass |
4993 |
527 |
Python |
281 |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
2025-03-21T12:09:25Z |
18 |
Liger-Kernel |
4695 |
284 |
Python |
50 |
Efficient Triton Kernels for LLM Training |
2025-03-22T02:10:51Z |
19 |
xtuner |
4405 |
332 |
Python |
213 |
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...) |
2025-03-21T09:34:48Z |
20 |
awesome-LLM-resourses |
4388 |
452 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources. |
2025-03-20T11:24:48Z |
21 |
agentops |
4069 |
363 |
Python |
83 |
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI |
2025-03-22T00:20:15Z |
22 |
chinese-llm-benchmark |
3817 |
166 |
None |
28 |
目前已囊括203个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果! |
2025-03-21T08:05:49Z |
23 |
mistral-finetune |
2893 |
260 |
Python |
31 |
None |
2024-09-13T09:53:13Z |
24 |
AI-System-School |
2827 |
321 |
None |
12 |
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials. |
2024-08-14T05:12:47Z |
25 |
paperless-ai |
2737 |
96 |
JavaScript |
8 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-03-21T19:24:53Z |
26 |
xTuring |
2640 |
207 |
Python |
10 |
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6 |
2024-09-23T09:40:48Z |
27 |
lsp-ai |
2610 |
91 |
Rust |
24 |
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them. |
2025-01-07T22:17:38Z |
28 |
secret-llama |
2601 |
164 |
TypeScript |
18 |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
2024-06-05T02:04:17Z |
29 |
elia |
2077 |
130 |
Python |
12 |
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. |
2024-10-10T19:12:52Z |
30 |
OnnxStream |
1924 |
89 |
C++ |
55 |
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. |
2025-03-18T05:20:30Z |
31 |
floneum |
1793 |
91 |
Rust |
38 |
Instant, controllable, local pre-trained AI models in Rust |
2025-03-22T00:11:37Z |
32 |
maid |
1793 |
205 |
Dart |
8 |
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely. |
2025-03-20T03:09:05Z |
33 |
dialoqbase |
1740 |
273 |
TypeScript |
39 |
Create chatbots with ease |
2024-10-15T14:24:20Z |
34 |
Ollamac |
1739 |
95 |
Swift |
35 |
Mac app for Ollama |
2025-03-12T22:28:22Z |
35 |
json_repair |
1619 |
80 |
Python |
0 |
A python module to repair invalid JSON from LLMs |
2025-03-19T12:21:14Z |
36 |
papersgpt-for-zotero |
1407 |
46 |
JavaScript |
36 |
Zotero chat PDF with AI, DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini |
2025-03-13T04:00:46Z |
37 |
search2ai |
1258 |
192 |
JavaScript |
17 |
Help your LLMs online |
2025-02-19T16:26:01Z |
38 |
modelfusion |
1241 |
89 |
TypeScript |
33 |
The TypeScript library for building AI applications. |
2024-07-19T15:17:19Z |
39 |
aws-genai-llm-chatbot |
1203 |
366 |
TypeScript |
21 |
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS |
2025-02-20T15:20:46Z |
40 |
nextjs-ollama-llm-ui |
1149 |
276 |
TypeScript |
13 |
Fully-featured web interface for Ollama LLMs |
2025-02-04T19:07:06Z |
41 |
gp.nvim |
1095 |
93 |
Lua |
41 |
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..] |
2024-09-23T12:32:50Z |
42 |
bedrock-claude-chat |
1065 |
392 |
TypeScript |
111 |
AWS-native chatbot using Bedrock + Claude (+Nova and Mistral) |
2025-03-21T17:46:28Z |
43 |
poe-api-wrapper |
1061 |
137 |
Python |
27 |
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀 |
2025-03-07T20:07:31Z |
44 |
LLM-Prompt-Library |
1039 |
112 |
Python |
0 |
My personal prompt library for various LLMs + scripts & tools. Suitable for models from Deepseek, OpenAI, Claude, Meta, Mistral, Google, Grok, and others. |
2025-03-18T17:04:23Z |
45 |
chatd |
1016 |
69 |
JavaScript |
26 |
Chat with your documents using local AI |
2024-07-06T01:21:36Z |
46 |
BaseAI |
977 |
82 |
TypeScript |
4 |
BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. |
2025-02-25T11:30:28Z |
47 |
RisuAI |
942 |
163 |
TypeScript |
58 |
Make your own story. User-friendly software for LLM roleplaying |
2025-03-21T04:52:47Z |
48 |
graphrag-local-ollama |
933 |
148 |
Python |
42 |
Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction |
2024-09-30T02:43:30Z |
49 |
ai-dev-gallery |
916 |
111 |
C# |
40 |
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps. |
2025-03-21T22:20:48Z |
50 |
generative-ai-use-cases-jp |
856 |
203 |
TypeScript |
88 |
すぐに業務活用できるビジネスユースケース集付きの安全な生成AIアプリ実装 |
2025-03-21T08:54:43Z |
51 |
witsy |
782 |
56 |
TypeScript |
3 |
Witsy: desktop AI assistant |
2025-03-21T22:04:20Z |
52 |
MixtralKit |
767 |
80 |
Python |
12 |
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI |
2023-12-15T19:10:55Z |
53 |
fine-tune-mistral |
709 |
63 |
Python |
3 |
Fine-tune mistral-7B on 3090s, a100s, h100s |
2023-10-11T17:25:59Z |
54 |
mistral-common |
697 |
78 |
Python |
17 |
None |
2025-03-19T22:27:53Z |
55 |
web-llm-chat |
693 |
114 |
TypeScript |
9 |
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations. |
2025-01-29T19:23:34Z |
56 |
Hexabot |
680 |
120 |
TypeScript |
117 |
Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. |
2025-03-21T14:51:54Z |
57 |
tt-metal |
670 |
119 |
C++ |
2130 |
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model. |
2025-03-22T03:17:39Z |
58 |
ComfyUI-IF_AI_tools |
615 |
47 |
Python |
50 |
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models. |
2025-03-09T09:11:32Z |
59 |
llm-finetuning |
573 |
89 |
Python |
3 |
Guide for fine-tuning Llama/Mistral/CodeLlama models and more |
2024-08-28T10:44:08Z |
60 |
mistral |
569 |
52 |
Python |
18 |
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers. |
2023-11-10T02:55:18Z |
61 |
Owl |
568 |
56 |
Python |
6 |
A personal wearable AI that runs locally |
2024-03-17T06:37:26Z |
62 |
client-python |
566 |
120 |
Python |
13 |
Python client library for Mistral AI platform |
2025-03-21T09:33:25Z |
63 |
parrot.nvim |
544 |
35 |
Lua |
3 |
parrot.nvim 🦜 - the plugin that brings stochastic parrots to Neovim. |
2025-03-18T11:57:54Z |
64 |
BambooAI |
540 |
54 |
Python |
11 |
A Python library powered by Language Models (LLMs) for conversational data discovery and analysis. |
2025-03-02T07:52:21Z |
65 |
ai-commits-intellij-plugin |
516 |
41 |
Kotlin |
23 |
AI Commits for IntelliJ based IDEs/Android Studio. |
2025-03-21T05:09:42Z |
66 |
llmcord |
503 |
98 |
Python |
2 |
Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more) |
2025-03-21T19:37:29Z |
67 |
rag-chatbot |
487 |
74 |
Python |
6 |
Chat with multiple PDFs locally |
2024-10-11T04:30:01Z |
68 |
helix |
472 |
47 |
Go |
124 |
🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class testing. |
2025-03-21T22:18:54Z |
69 |
embedJs |
469 |
52 |
TypeScript |
25 |
A NodeJS RAG framework to easily work with LLMs and embeddings |
2025-02-14T10:53:44Z |
70 |
ollama-voice-mac |
467 |
54 |
Python |
8 |
Mac compatible Ollama Voice |
2024-03-26T14:49:04Z |
71 |
aikit |
436 |
36 |
Go |
20 |
🏗️ Fine-tune, build, and deploy open-source LLMs easily! |
2025-03-17T03:13:08Z |
72 |
obsidian-bmo-chatbot |
430 |
59 |
TypeScript |
45 |
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian. |
2024-09-12T04:07:29Z |
73 |
mlx-llm |
429 |
31 |
Python |
0 |
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX. |
2025-01-29T07:13:07Z |
74 |
LESS |
421 |
40 |
Jupyter Notebook |
15 |
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning |
2024-10-20T03:11:58Z |
75 |
bolna |
413 |
112 |
Python |
28 |
End-to-end platform for building voice first multimodal agents |
2024-10-28T05:40:38Z |
76 |
xllm |
399 |
21 |
Python |
6 |
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
2024-01-17T16:43:39Z |
77 |
DevoxxGenieIDEAPlugin |
394 |
47 |
Java |
37 |
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM's (Ollama, LMStudio, GPT4All, Jan and Llama.cpp) and Cloud based LLMs to help review, test, explain your project code. |
2025-03-21T18:09:04Z |
78 |
fltr |
380 |
8 |
Rust |
1 |
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
2024-03-13T11:39:01Z |
79 |
GPTPortal |
363 |
65 |
JavaScript |
2 |
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files. |
2025-03-07T19:37:35Z |
80 |
edgen |
356 |
16 |
Rust |
23 |
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs (Llama2, Mistral, Mixtral...), Speech-to-text (whisper) and many others. |
2024-05-23T14:21:38Z |
81 |
NeuralFlow |
344 |
15 |
Python |
4 |
Visualize the intermediate output of Mistral 7B |
2025-01-22T11:25:17Z |
82 |
KVQuant |
336 |
30 |
Python |
14 |
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
2024-08-13T11:19:28Z |
83 |
ai_automation_suggester |
336 |
12 |
Python |
4 |
This custom Home Assistant integration automatically scans your entities, detects new devices, and uses AI (via cloud and local APIs) to suggest tailored automations. It supports multiple AI providers, including OpenAI, Anthropic, Google, Groq, LocalAI, Mistral and Ollama. The integration provides automation suggestions via HASS notifications |
2025-03-09T20:09:18Z |
84 |
LLaMa2lang |
301 |
34 |
Python |
0 |
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language |
2024-06-17T14:00:13Z |
85 |
mistral |
291 |
118 |
Python |
0 |
Workflow Service for OpenStack. Mirror of code maintained at opendev.org. |
2025-03-18T23:37:58Z |
86 |
OllamaKit |
282 |
29 |
Swift |
5 |
Ollama client for Swift |
2025-03-09T22:20:34Z |
87 |
nanodl |
282 |
10 |
Python |
2 |
A Jax-based library for designing and training transformer models from scratch. |
2024-08-28T21:24:22Z |
88 |
airunner |
282 |
23 |
Python |
29 |
Stable Diffusion and LLMs offline on your own hardware |
2025-03-21T16:51:48Z |
89 |
simple-openai |
275 |
30 |
Java |
5 |
A Java library to use the OpenAI Api in the simplest possible way. |
2025-03-20T21:52:03Z |
90 |
yalm |
273 |
27 |
C++ |
1 |
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O |
2025-01-15T07:22:42Z |
91 |
llm-mistral-invoice-cpu |
264 |
63 |
Python |
0 |
Data extraction with LLM on CPU |
2024-03-26T05:44:59Z |
92 |
Heat |
258 |
17 |
Swift |
4 |
An LLM agnostic desktop and mobile client. |
2025-03-21T16:30:16Z |
93 |
unsaged |
255 |
78 |
TypeScript |
15 |
Open source chat kit engineered for seamless interaction with AI models. |
2025-02-25T18:02:25Z |
94 |
aicommit2 |
252 |
20 |
TypeScript |
7 |
A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI |
2025-03-18T02:12:34Z |
95 |
inferflow |
238 |
25 |
C++ |
8 |
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs). |
2024-03-15T06:52:33Z |
96 |
ai-playground |
233 |
52 |
Python |
0 |
Code from tutorials presented on the "Code AI with Rok" YouTube channel |
2025-03-18T17:22:38Z |
97 |
companion-vscode |
231 |
12 |
TypeScript |
3 |
VSCode extension of Quack Companion 💻 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot powered by OSS LLMs (Mistral, Gemma, etc.), served with Ollama. |
2024-10-01T04:06:14Z |
98 |
TPU-Alignment |
230 |
25 |
Jupyter Notebook |
0 |
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free |
2024-10-31T20:34:59Z |
99 |
ProX |
228 |
18 |
Python |
2 |
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale" |
2025-02-16T07:59:43Z |
100 |
ollama-ai |
227 |
8 |
Ruby |
0 |
A Ruby gem for interacting with Ollama's API that allows you to run open source AI LLMs (Large Language Models) locally. |
2024-07-21T11:13:36Z |