EffiLLaMA

Overview

This project demonstrates how to fine-tune the LLaMA 3.2-1B Instruct model using text extracted from the Harry Potter book series. The training was conducted using LoRA (Low-Rank Adaptation) and QLoRA techniques for parameter-efficient fine-tuning. The goal is to create a custom model that generates Harry Potter-themed text and understands the context specific to the book series.

Key Features

Extracted text from all Harry Potter books and chunked it for training.
Fine-tuned LLaMA 3.2-1B using LoRA and QLoRA for causal language modeling.
Efficient parameter tuning with reduced memory requirements.
Saved fine-tuned model weights for inference or further fine-tuning.

What is LoRA?

LoRA (Low-Rank Adaptation) is a technique designed to make fine-tuning large language models more efficient. Instead of updating all model parameters during fine-tuning, LoRA introduces additional low-rank trainable matrices into specific layers of the model, significantly reducing the number of parameters that need to be updated.

Benefits of LoRA:

Significantly reduces memory usage.
Faster fine-tuning on large-scale models.
Maintains high performance with fewer trainable parameters.

For a deeper understanding, refer to the LoRA paper:

LoRA: Low-Rank Adaptation of Large Language Models by Edward J. Hu et al.

What is QLoRA?

QLoRA (Quantized LoRA) builds upon the LoRA framework by using quantization techniques. It leverages 4-bit quantization for the model weights to further reduce memory usage while maintaining the ability to fine-tune efficiently.

QLoRA uses:

4-bit quantized base models for reduced memory consumption.
Parameter-efficient fine-tuning to adapt the model to new tasks.

Key Advantages:

Allows training of large-scale models on a single GPU.
Reduces the computational footprint without sacrificing performance.

For more details, check the QLoRA paper:

QLoRA: Efficient Finetuning of Quantized LLMs by Tim Dettmers et al.

Setup Instructions

Step 1: Clone the Repository

git clone https://github.com/AnanthaPadmanaban-KrishnaKumar/EffiLLaMA.git
cd EffiLLaMA

Step 2: Set Up a Virtual Environment (Recommended)

python -m venv env
source env/bin/activate   # On Linux/macOS
env\Scripts\activate      # On Windows

Step 3: Install Dependencies

pip install -r requirements.txt

Dataset Preparation

The dataset is based on the text extracted from the Harry Potter book series. The preprocessing steps included:

Loading PDF files using PyPDFDirectoryLoader.
Splitting text into chunks using RecursiveCharacterTextSplitter with: Chunk Size: 1500 tokens Chunk Overlap: 50 tokens
Normalizing the text (e.g., removing unnecessary characters and newlines).

Dataset Format

The resulting dataset is stored as a list of dictionaries with the format:

[
  {
    "text": "Harry Potter and the Philosopher's Stone begins with..."
  },
  ...
]

Training the Model

The train.py script implements the following:

Data Preprocessing: Loads, chunks, and normalizes the text.
Dataset Preparation: Converts text into a Hugging Face Dataset for training.
Model Initialization: Loads the LLaMA 3.2-1B Instruct model and tokenizer.
LoRA Configuration: Applies parameter-efficient tuning with LoRA.
Training: Fine-tunes the model using Trainer with mixed precision (fp16)

Run Training

python train.py

LoRA Configuration

lora_config = LoraConfig(
    r=16,                        
    lora_alpha=32,               
    target_modules=[             
        "q_proj", "v_proj", 
        "k_proj", "o_proj", 
        "gate_proj", "up_proj", 
        "down_proj"
    ],
    lora_dropout=0.1,            
    bias="none",                 
    task_type="CAUSAL_LM"        
)

Key Parameters:

r: Low-rank dimension for decomposition matrices.
lora_alpha: Scaling factor for LoRA outputs.
target_modules: Specifies which layers to adapt with LoRA.
lora_dropout: Dropout rate for regularization.
task_type: Task type set to CAUSAL_LM (causal language modeling).

Training Output

The fine-tuned model is saved in the final_model directory.
Logs and checkpoints are saved during training for monitoring progress.

Inference

The fine-tuned model is hosted on Hugging Face at EffiLLaMA on Hugging Face.

Example Usage

Here’s how to use the model for inference:

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel

# Load tokenizer and model
model_name = "AIAlbus/EffiLLaMA"
tokenizer = AutoTokenizer.from_pretrained(model_name)
base_model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", torch_dtype=torch.float16)
model = PeftModel.from_pretrained(base_model, model_name)

# Prepare input
input_text = "Why did Harry Potter survive Voldemort's attack?"
inputs = tokenizer(input_text, return_tensors="pt", padding=True, truncation=True)

# Generate response
output = model.generate(inputs['input_ids'], max_length=150, do_sample=True, temperature=0.7)
print(tokenizer.decode(output[0], skip_special_tokens=True))

Acknowledgments

Hugging Face Transformers for tools to load and fine-tune the model. LangChain for efficient text preprocessing. Research on LoRA and QLoRA for parameter-efficient fine-tuning methods.

License

This project is licensed under the MIT. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
datasets		datasets
final_model		final_model
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EffiLLaMA

Overview

Key Features

What is LoRA?

Benefits of LoRA:

What is QLoRA?

QLoRA uses:

Key Advantages:

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up a Virtual Environment (Recommended)

Step 3: Install Dependencies

Dataset Preparation

Dataset Format

Training the Model

Run Training

LoRA Configuration

Key Parameters:

Training Output

Inference

Example Usage

Acknowledgments

License

About

Releases

Packages

Languages

License

AnanthaPadmanaban-KrishnaKumar/EffiLLaMA

Folders and files

Latest commit

History

Repository files navigation

EffiLLaMA

Overview

Key Features

What is LoRA?

Benefits of LoRA:

What is QLoRA?

QLoRA uses:

Key Advantages:

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up a Virtual Environment (Recommended)

Step 3: Install Dependencies

Dataset Preparation

Dataset Format

Training the Model

Run Training

LoRA Configuration

Key Parameters:

Training Output

Inference

Example Usage

Acknowledgments

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages