IterativeLLMRefiner

IterativeLLMRefiner is an innovative project that implements a chain of Large Language Models (LLMs) to iteratively refine and improve prompts. By leveraging multiple LLMs in sequence, each model builds upon the previous output, resulting in more detailed, accurate, and coherent responses.

Features

Iterative Refinement: Uses multiple LLMs in sequence to progressively improve responses
Hardware-Aware: Automatically selects models based on available RAM and use cases
Domain Detection: Intelligently identifies the domain of the input prompt
Optional Reasoning: Includes step-by-step reasoning for complex queries
Streaming Responses: Real-time streaming of model outputs
Docker Support: Easy deployment with Docker and Docker Compose
Modern Web Interface: User-friendly frontend for interaction

Prerequisites

Docker and Docker Compose
At least 8GB of RAM (for basic models)
Ollama installed and running locally

Quick Start

Clone the repository:

git clone https://github.com/dhruv1110/IterativeLLMRefiner.git
cd IterativeLLMRefiner

Start the application using Docker Compose:

docker-compose up

Open your browser and navigate to http://localhost:3000

Architecture

The project consists of three main components:

Frontend: React-based web interface for user interaction
Backend: FastAPI server handling model interactions and processing
Ollama Integration: Local LLM server for model inference

Available Models

The system supports various models based on available RAM:

8GB: Basic models (3B-7B parameters)
16GB: Medium models (7B-14B parameters)
32GB: Large models (13B-32B parameters)
64GB: Extra large models (70B parameters)
128GB: Ultra large models (110B parameters)

Models are categorized by use case:

General
Research
Reasoning
Coding
Vision

API Endpoints

GET /models: List available models
GET /pull-model: Pull a specific model
POST /generate: Generate refined responses using the model chain

Development

Backend Setup

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

cd backend
pip install -r requirements.txt

Run the development server:

uvicorn main:app --reload

Frontend Setup

Install dependencies:

cd frontend
npm install

Start the development server:

npm start

Contributing

Please read CONTRIBUTING.md for details on our code of conduct and the process for submitting pull requests.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Ollama for providing the LLM infrastructure
All the open-source LLM models used in this project
The open-source community for their valuable contributions

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
frontend		frontend
webui		webui
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
models.md		models.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IterativeLLMRefiner

Features

Prerequisites

Quick Start

Architecture

Available Models

API Endpoints

Development

Backend Setup

Frontend Setup

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

dhruv1110/IterativeLLMRefiner

Folders and files

Latest commit

History

Repository files navigation

IterativeLLMRefiner

Features

Prerequisites

Quick Start

Architecture

Available Models

API Endpoints

Development

Backend Setup

Frontend Setup

Contributing

License

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages