-
Notifications
You must be signed in to change notification settings - Fork 440
Issues: pdf2htmlEX/pdf2htmlEX
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Anyone working on integrating this pdf2htmlEx with Mistral OCR or other LLM parsing?
#195
opened Apr 7, 2025 by
dominikusbrian
How to control the size of html as large as the original pdf while saving the accuracy of the image
#194
opened Apr 3, 2025 by
Cguanqin
Generated html page not working correctly with "Text fragments" in URL for multi word fragments
#184
opened Nov 6, 2024 by
jmozmoz
pdf2htmlEX parses all the images on the page into single image with white background
#183
opened Oct 25, 2024 by
dstepanenko
Why is some of the text not extracted and is basked into the generated images?
#166
opened Mar 19, 2024 by
isaacfink
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-17.