4,531 Downloads Updated yesterday
DeepSeek-OCR requires Ollama v0.13.0 or later.
DeepSeek-OCR is a vision-language model that can perform token-efficient optical character recognition (OCR).
Please note, the model is sensitive to its input. For example, a missing punctuation or new line may cause an improper output.
ollama run deepseek-ocr "/path/to/image\n<|grounding|>Given the layout of the image."
ollama run deepseek-ocr "/path/to/image\nFree OCR."
ollama run deepseek-ocr "/path/to/image\nParse the figure."
ollama run deepseek-ocr "/path/to/image\nExtract the text in the image."
ollama run deepseek-ocr "/path/to/image\n<|grounding|>Convert the document to markdown."