DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
88.4M Pulls 35 Tags Updated 11 months ago
DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a large context window and three reasoning modes.
137.5K Pulls 1 Tag Updated 1 month ago
DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
2.2M Pulls 1 Tag Updated 6 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
1.2M Pulls 5 Tags Updated 1 year ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
1.1M Pulls 15 Tags Updated 1 year ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
406.7K Pulls 9 Tags Updated 1 year ago
DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.
475.4K Pulls 3 Tags Updated 7 months ago
28 Pulls 1 Tag Updated 3 days ago
A Deepseek-R1:8b model with Deepseek-R1:1.5b model as drafting model
104 Pulls 1 Tag Updated 2 weeks ago
A Deepseek-r1:7b Model with Deepseek-r1:8b temp and TopP, while having Deepseek-r1:1.5b as a speculative draft model
81 Pulls 1 Tag Updated 2 weeks ago
62 Pulls 1 Tag Updated 2 weeks ago
42 Pulls 1 Tag Updated 3 weeks ago
I have just enabled both calling and thinking to existing deepseek-r1 models.
1,814 Pulls 6 Tags Updated 2 months ago
1,816 Pulls 2 Tags Updated 2 months ago
ollama run f0rc3ps/deepseek-r1-32b-uncensored:nu11secur1ty
1,181 Pulls 1 Tag Updated 3 months ago
DeepSeek-R1-0528-Qwen3-8B
709 Pulls 1 Tag Updated 1 month ago
822 Pulls 1 Tag Updated 3 months ago
601 Pulls 12 Tags Updated 4 months ago
Huggingface link - https://huggingface.co/iradukunda-dev/law-finetuned-DeepSeek-R1-Distill-Qwen-7B
465 Pulls 1 Tag Updated 5 months ago
SmallCoder is a compact reasoning-focused coding model, fine-tuned from DeepSeek-R1 1.5B using a code dataset that includes step-by-step reasoning.
292 Pulls 1 Tag Updated 4 months ago