DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.
346.9K Pulls 3 Tags Updated 3 months ago
DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
52.9K Pulls 1 Tag Updated 3 months ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
472.7K Pulls 8 Tags Updated 5 months ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
80.2M Pulls 35 Tags Updated 8 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
3.4M Pulls 102 Tags Updated 2 years ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
901.9K Pulls 15 Tags Updated 11 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
299.2K Pulls 9 Tags Updated 1 year ago
An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
193.6K Pulls 7 Tags Updated 1 year ago
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
3.7M Pulls 5 Tags Updated 1 year ago
Based on DeepSeek R1 because OpenCode tries to verify on the registry for tool compatibility
91 Pulls 1 Tag Updated 1 week ago
31 Pulls 2 Tags Updated 3 weeks ago
NovaForge AI – DeepSeek Coder 6.7B Pro is a professional-grade coding AI built for production-level development.
1,271 Pulls 1 Tag Updated 2 months ago
Huggingface link - https://huggingface.co/iradukunda-dev/law-finetuned-DeepSeek-R1-Distill-Qwen-7B
226 Pulls 1 Tag Updated 2 months ago
SmallCoder is a compact reasoning-focused coding model, fine-tuned from DeepSeek-R1 1.5B using a code dataset that includes step-by-step reasoning.
128 Pulls 1 Tag Updated 1 month ago
基于 DeepSeek-R1-Distill-Qwen-1.5B 微调的中文轻量对话模型,自带猫娘口癖与亲昵风格。
263 Pulls 1 Tag Updated 5 months ago
SmallCoder is a compact reasoning-focused math model, fine-tuned from DeepSeek-R1 1.5B using a math dataset that includes step-by-step reasoning.
7 Pulls 1 Tag Updated 3 weeks ago
This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions. (quantized IQ4_XS)
4,145 Pulls 3 Tags Updated 2 months ago
DeepSeek-R1-0528-Qwen3-8B-IQ4_NL
3,480 Pulls 1 Tag Updated 9 months ago
DeepSeek R1 0528 Qwen3 8B with tool calling/MCP support
2,531 Pulls 1 Tag Updated 8 months ago
DeepSeek R1 0528 Qwen3 8B Q4 with tool calling
1,610 Pulls 1 Tag Updated 9 months ago