Skip to content

Python utility to convert multi PDF to plain text files vía OCR

License

Notifications You must be signed in to change notification settings

lecovi/ocrPDF2TXT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ocrPDF2TXT

Python utility to convert scanned multipage PDF to plain text files via OCR.

Dependencies

  • Python 3
  • ImageMagick
  • Tesseract

Advantages

It doesn't uses Pillow, nor ReportLab nor Rufus ;-)

About

Python utility to convert multi PDF to plain text files vía OCR

Resources

License

Stars

Watchers

Forks

Packages

No packages published