Image Search in Natural Language

This is a web application that lets you search for images using natural language, powered by modern Vision Transformer (ViT) models. It works by processing and indexing the images you upload, then allowing you to find them by typing in what you're looking for.

Demo

Watch the demo video to see the application in action:

System Architecture

Core Components

AIPhotoGallery (Main Controller)

Manages the entire image processing and search pipeline
Initializes CLIP model on available device (CPU/CUDA)
Handles index management and search operations

Components:

class AIPhotoGallery:
    def __init__(self):
        self.model = SentenceTransformer("clip-ViT-L-14")
        self.indexing_manager = IndexingManager()
        self.image_processor = ImageProcessor()

Image Processing System

Handles image validation and deduplication
Supports formats: .jpg, .jpeg, .png, .gif, .bmp, .webp
MD5 hash-based duplicate detection
Image verification using PIL

class ImageProcessor:
    SUPPORTED_FORMATS = {'.jpg', '.jpeg', '.png', .gif', '.bmp', '.webp'}
    def process_image(self, image_path: Path):
        image = Image.open(image_path).convert("RGB")
        return self.model.encode(image)

Indexing System
- FAISS-based vector similarity search
- Asynchronous background indexing
- Incremental index updates
- Index caching with 5-minute TTL
```
[Index Structure]
/Index/
├── vector.index      # FAISS vector index
└── vector.index.paths # Mapped image paths
```

Data Flow

[Image Upload Flow]
1. Upload Request → Duplicate Check (MD5) → Save to /images/
2. Background Indexing:
   Image → CLIP Embedding → FAISS Index Update

[Search Flow]
1. Text Query → CLIP Text Embedding
2. FAISS Similarity Search → Top K Similar Images
3. Dynamic HTML Gallery Generation

Directory Organization

/
├── app/
│   ├── models/
│   │   ├── gallery.py     # Main controller
│   │   └── indexing.py    # Index management
│   ├── utils/
│   │   ├── image_processor.py  # Image handling
│   │   └── search.py      # FAISS operations
│   ├── routes.py          # FastAPI endpoints
│   └── __init__.py        # App initialization
├── images/                # Image storage
│   └── [uploaded images]
├── Index/                 # FAISS indexes
│   ├── vector.index
│   └── vector.index.paths
└── templates/             # Jinja2 templates

API Implementation

@app.post("/upload")
async def upload(files: List[UploadFile]):
    # 1. Validate and save images
    # 2. Trigger background indexing
    # 3. Return upload status

@app.post("/search")
async def search(query: SearchQuery):
    # 1. Process text query
    # 2. Perform FAISS search
    # 3. Generate gallery HTML

Technical Stack Details

ML Framework

CLIP (ViT-L-14) for text-image embeddings
PyTorch backend with CUDA support
FAISS for efficient similarity search

device = "cuda" if torch.cuda.is_available() else "cpu"
model = SentenceTransformer("clip-ViT-L-14", device=device)

Web Framework
- FastAPI with async support
- Jinja2 templating
- Static file serving
- Multipart file upload handling
Storage System
- File-based image storage
- FAISS vector index
- Path mapping for image retrieval

Performance Features

Efficient Image Processing
- Duplicate detection before processing
- Async background indexing
- Incremental index updates
Search Optimization
- FAISS index caching
- Lazy loading of gallery images
- Async search operations
Resource Management
- CUDA acceleration when available
- Background task executor
- Memory-efficient index loading

Setup and Running

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python run.py
```

The server will start at http://0.0.0.0:3000 with hot-reload enabled for development.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Index		Index
app		app
docs/assets		docs/assets
static		static
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Search in Natural Language

Demo

System Architecture

Core Components

Data Flow

Directory Organization

API Implementation

Technical Stack Details

Performance Features

Setup and Running

About

Releases

Packages

Languages

License

asiff00/Image-Search-in-Natural-Language

Folders and files

Latest commit

History

Repository files navigation

Image Search in Natural Language

Demo

System Architecture

Core Components

Data Flow

Directory Organization

API Implementation

Technical Stack Details

Performance Features

Setup and Running

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages