LLMOCR

LLMOCR uses a local LLM to read text from images.

You can also change the instruction to have the LLM use the image in the way that you prompt.

Features

Local Processing: All processing is done locally on your machine.
User-Friendly GUI: Includes a GUI. Relies on Koboldcpp, a single executable, for all AI functionality.
GPU Acceleration: Will use Apple Metal, Nvidia CUDA, or AMD (Vulkan) hardware if available to greatly speed inference.
Cross-Platform: Supports Windows, macOS ARM, and Linux.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
joy-caption.bat		joy-caption.bat
joy-caption.kcppt		joy-caption.kcppt
llmocr.bat		llmocr.bat
llmocr.kcppt		llmocr.kcppt
llmocr.png		llmocr.png
llmocr_no_kobold.bat		llmocr_no_kobold.bat
requirements.txt		requirements.txt