Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.8M Pulls 5 Tags Updated 1 year ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
2M Pulls 53 Tags Updated 2 years ago
24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
865.8K Pulls 6 Tags Updated 6 months ago
123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
237K Pulls 6 Tags Updated 6 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
4.3M Pulls 102 Tags Updated 2 years ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
1.2M Pulls 5 Tags Updated 1 year ago
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
1.8M Pulls 70 Tags Updated 1 year ago
An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
277.3K Pulls 7 Tags Updated 1 year ago
DBRX is an open, general-purpose LLM created by Databricks.
311.4K Pulls 7 Tags Updated 2 years ago
DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.
120.7K Pulls 1 Tag Updated 1 month ago
47 Pulls 4 Tags Updated 1 week ago
DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
2.2M Pulls 1 Tag Updated 5 months ago
11 Pulls 9 Tags Updated 1 week ago
DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.
469.7K Pulls 3 Tags Updated 6 months ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
699.1K Pulls 8 Tags Updated 8 months ago
Testing new conversion patterns...
322 Pulls 36 Tags Updated 1 week ago
196 Pulls 3 Tags Updated 1 month ago
The next generation of light models built to play Minecraft! Inspired by Andy-4
138 Pulls 2 Tags Updated 3 months ago
30 Pulls 6 Tags Updated 1 week ago
MyJobs‑aware assistant model tuned for the MyJobs repo and architecture. It knows the FastAPI + PostgreSQL backend, Next.js + TypeScript + Tailwind frontend, Docker/Ollama deployment setup, and is biased toward app.findmyjobs.app
19 Pulls 1 Tag Updated 4 months ago