60 Downloads Updated 2 months ago
ollama run HSR-projects/OpenScan-1.0
Updated 2 months ago
2 months ago
d97b1f4cb961 · 4.7GB ·
A lightweight, local-first AI-powered OCR system for extracting and structuring text from images.
OpenScan-1.0 is a modern OCR pipeline that combines vision models and language models to extract text from images and convert it into clean, structured output.
Unlike traditional OCR engines, OpenScan focuses on:
Image → Vision Model → Raw Text → AI Cleanup → Structured Output
Vision Model (BakLLaVA / LLaVA) → Extracts raw text from images
OpenScan Model → Cleans, corrects, and structures the text
Optional Post-processing → Converts into formats like JSON, Markdown, or plain text
https://ollama.com
ollama pull bakllava
ollama pull llava:7b
Create a Modelfile:
FROM llava:7b
SYSTEM "You are OpenScan-1.0, an AI OCR assistant. Extract text from images accurately, fix OCR errors, and return clean, structured output."
PARAMETER temperature 0.2
PARAMETER top_p 0.9
Build the model:
ollama create OpenScan-1.0 -f Modelfile
ollama run bakllava
Prompt:
Extract all visible text from this image exactly as written.
Do not explain anything.
ollama run OpenScan-1.0
Prompt:
Clean and structure this OCR output.
Fix errors and return readable text.
H3llo W0rld!!
Th1s 1s @ t3st.
Hello World!!
This is a test.
Contributions are welcome:
This project relies on third-party models. Ensure compliance with respective model licenses.
HSR Projects
OpenScan-1.0 is an experimental OCR system. Results may vary depending on input quality and model limitations.
Not just OCR — intelligent text understanding.