Optical Character Recognition (OCR) Service

This repository contains a Flask-based REST API for performing Optical Character Recognition (OCR) on uploaded images. It utilizes the Flask-RESTx library to define the API endpoints and manage request and response formats.

Getting Started

Prerequisites

Python (>=3.6)
Install required packages using pip install -r requirements.txt

Running the API

Clone this repository: git clone https://github.com/idan1356/flask_OCR_Service.git
Navigate to the project directory: cd ocr-service
Run the application: python app.py

The API will be accessible at http://localhost:5000/api/decipher-text.

API Endpoints

The API documentation provides detailed information about the available endpoints, request and response formats, and example usage. You can access the documentation by visiting /api/docs in your browser.

Upload Image for OCR

Endpoint: POST /api/decipher-text

This endpoint allows you to upload an image for Optical Character Recognition.

Parameters:

user_image: The image file to be processed.
langs: Comma-separated list of languages in the image (see language list in config file).
get_processed_image: Whether the processed image (rectangles around sentences found by the model) should be returned.

Responses:

200 OK: OCR successful. Returns OCR results and optional processed image.
400 Bad Request: OCR unsuccessful. Returns an error message.

Project Structure

The project is organized as follows:

app.py: The main Flask application file.
api/: Contains the API definition and resources.
api/components/: Contains parser and data model definitions.
api/utils/: Contains utility functions for processing images and handling API logic.

OCR Model Caching

The OCR model used by the service is LRU (Least Recently Used) cached to improve performance. The caching is implemented using the functools.lru_cache decorator, to cache OCR models based on the languages used for processing.

Configuration

You can configure languages and other settings in the config file.

Contributing

Contributions are welcome! Please open issues or submit pull requests for bug fixes, features, or improvements.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
api		api
model		model
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.json		config.json
readme.png		readme.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition (OCR) Service

Getting Started

Prerequisites

Running the API

API Endpoints

Upload Image for OCR

Project Structure

OCR Model Caching

Configuration

Contributing

License

About

Releases

Packages

Languages

License

idan1356/flask_OCR_Service

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition (OCR) Service

Getting Started

Prerequisites

Running the API

API Endpoints

Upload Image for OCR

Project Structure

OCR Model Caching

Configuration

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages