dne · Ollama

dolphin3

Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.

8b

3.8M Pulls 5 Tags Updated 1 year ago

dolphin-llama3

Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.

8b 70b

2M Pulls 53 Tags Updated 2 years ago

devstral-small-2

24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

vision tools cloud 24b

865.8K Pulls 6 Tags Updated 6 months ago

devstral-2

123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

tools cloud 123b

237K Pulls 6 Tags Updated 6 months ago

deepseek-coder

DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

1.3b 6.7b 33b

4.3M Pulls 102 Tags Updated 2 years ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

1.2M Pulls 5 Tags Updated 1 year ago

dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

8x7b 8x22b

1.8M Pulls 70 Tags Updated 1 year ago

deepseek-v2.5

An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.

236b

277.3K Pulls 7 Tags Updated 1 year ago

dbrx

DBRX is an open, general-purpose LLM created by Databricks.

132b

311.4K Pulls 7 Tags Updated 2 years ago

deepseek-v4-pro

DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

tools thinking cloud

120.7K Pulls 1 Tag Updated 1 month ago

dhiltgen/qwen3-coder-next

tools

47 Pulls 4 Tags Updated 1 week ago

deepseek-v3.2

DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

tools thinking cloud

2.2M Pulls 1 Tag Updated 5 months ago

dhiltgen/qwen3-next

tools thinking 80b

11 Pulls 9 Tags Updated 1 week ago

deepseek-ocr

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

469.7K Pulls 3 Tags Updated 6 months ago

deepseek-v3.1

DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

tools thinking cloud 671b

699.1K Pulls 8 Tags Updated 8 months ago

dhiltgen/gemma3

Testing new conversion patterns...

vision 270m 1b 4b 12b 27b

322 Pulls 36 Tags Updated 1 week ago

dhiltgen/gpt-oss

Testing new conversion patterns...

tools thinking 20b 120b

196 Pulls 3 Tags Updated 1 month ago

DedeProGames/andy-lite

The next generation of light models built to play Minecraft! Inspired by Andy-4

138 Pulls 2 Tags Updated 3 months ago

dhiltgen/nemotron3

vision tools thinking 33b

30 Pulls 6 Tags Updated 1 week ago

daudfarzand/myjobsqwen

MyJobs‑aware assistant model tuned for the MyJobs repo and architecture. It knows the FastAPI + PostgreSQL backend, Next.js + TypeScript + Tailwind frontend, Docker/Ollama deployment setup, and is biased toward app.findmyjobs.app

tools

19 Pulls 1 Tag Updated 4 months ago