deepseek-ocr

deepseek-ocr:latest

184.1K Downloads Updated 3 months ago

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

ollama run deepseek-ocr

curl http://localhost:11434/api/chat \
  -d '{
    "model": "deepseek-ocr",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='deepseek-ocr',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'deepseek-ocr',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 3 months ago

3 months ago

0e7b018b8a22 · 6.7GB ·

model

archdeepseekocr

parameters3.34B

quantizationF16

6.7GB

license

MIT License Copyright (c) [year] [fullname] Permission is hereby granted, free of charge, to any per

1.1kB

params

{ "temperature": 0 }

18B

Readme

DeepSeek-OCR requires Ollama v0.13.0 or later.

DeepSeek-OCR is a vision-language model that can perform token-efficient optical character recognition (OCR).

Example inputs

Please note, the model is sensitive to its input. For example, a missing punctuation or new line may cause an improper output.

ollama run deepseek-ocr "/path/to/image\n<|grounding|>Given the layout of the image."

ollama run deepseek-ocr "/path/to/image\nFree OCR."

ollama run deepseek-ocr "/path/to/image\nParse the figure."

ollama run deepseek-ocr "/path/to/image\nExtract the text in the image."

ollama run deepseek-ocr "/path/to/image\n<|grounding|>Convert the document to markdown."

Examples

References

Arxiv paper

> DeepSeek-OCR requires [Ollama v0.13.0](https://github.com/ollama/ollama/releases) or later.

DeepSeek-OCR is a vision-language model that can perform token-efficient optical character recognition (OCR).

![fig1.png](/assets/library/deepseek-ocr/e93c9353-3836-4680-a7f1-148ad3e47eff)

### Example inputs

Please note, the model is sensitive to its input. For example, a missing punctuation or new line may cause an improper output.

```
ollama run deepseek-ocr "/path/to/image\n<|grounding|>Given the layout of the image."
```

```
ollama run deepseek-ocr "/path/to/image\nFree OCR."
```

```
ollama run deepseek-ocr "/path/to/image\nParse the figure."
```

```
ollama run deepseek-ocr "/path/to/image\nExtract the text in the image."
```

```
ollama run deepseek-ocr "/path/to/image\n<|grounding|>Convert the document to markdown."
```

### Examples

![show1.jpg](/assets/library/deepseek-ocr/445a87aa-b34e-4a85-8aba-921dd54cfd86)

![show2.jpg](/assets/library/deepseek-ocr/78c11fd6-d1be-4983-a08a-5188c65fcd1f)

![show3.jpg](/assets/library/deepseek-ocr/44b2ac4a-d52e-4f77-843d-3c828e7ecaf0)

![show4.jpg](/assets/library/deepseek-ocr/39e4d41b-634e-407a-a26e-7880c0a2b137)

### References

- [Arxiv paper](https://arxiv.org/abs/2510.18234)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)