Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which is the ocr engine that is opted by spacy-layout #25

Open
Zaheer-10 opened this issue Jan 8, 2025 · 1 comment
Open

Which is the ocr engine that is opted by spacy-layout #25

Zaheer-10 opened this issue Jan 8, 2025 · 1 comment
Labels
docling Related to Docling library and models

Comments

@Zaheer-10
Copy link

No description provided.

@ines
Copy link
Member

ines commented Feb 23, 2025

spacy-layout uses Docling under the hood, which also takes care of the OCR. Also see the technical report paper for more details:

Docling provides optional support for OCR, for example to cover scanned PDFs or content in bitmaps images embedded on a page. In our initial release, we rely on EasyOCR [1], a popular thirdparty OCR library with support for many languages. Docling, by default, feeds a high-resolution page image (216 dpi) to the OCR engine, to allow capturing small print detail in decent quality. While EasyOCR delivers reasonable transcription quality, we observe that it runs fairly slow on CPU (upwards of 30 seconds per page).

We are actively seeking collaboration from the open-source community to extend Docling with
additional OCR backends and speed improvements.

@ines ines added the docling Related to Docling library and models label Feb 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docling Related to Docling library and models
Projects
None yet
Development

No branches or pull requests

2 participants