A large language model that can use text prompts to generate and discuss code.
4.9M Pulls 199 Tags Updated 1 year ago
The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.
147.2K Pulls 6 Tags Updated 4 months ago
Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.
993K Pulls 4 Tags Updated 1 month ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
859.1K Pulls 5 Tags Updated 1 year ago
Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion parameters.
785.3K Pulls 67 Tags Updated 1 year ago
A compact, yet powerful 10.7B large language model designed for single-turn conversation.
736.5K Pulls 32 Tags Updated 2 years ago
2,771 Pulls 2 Tags Updated 4 days ago
2,070 Pulls 1 Tag Updated 6 days ago
Qwen 3.5 distilled from Claude Opus 4.6
1,153 Pulls 5 Tags Updated 2 days ago
85 Pulls 1 Tag Updated 2 days ago
43 Pulls 1 Tag Updated 2 days ago
Qwen3 (8B) ultra distilled from Claude, GPT, and Gemini.
33 Pulls 2 Tags Updated yesterday
27 Pulls 1 Tag Updated 3 days ago
3 Pulls 1 Tag Updated 2 days ago
26.3.7. Qwen3.5-27B-Q4 Update: This model introduces higher-quality reasoning trajectories across domains such as science, instruction-following, and mathematics.
7,759 Pulls 1 Tag Updated 3 weeks ago
9B coding agent based on Qwen3.5-9B, fine-tuned on 425K real agentic traces from Claude Opus 4.6, GPT-5.4, and Gemini 3.1. Reads before it writes, traces bugs to the root cause, doesn't clobber your existing code.
6,664 Pulls 3 Tags Updated 2 weeks ago
Q8_0 Coder Claude-4.6-Opus
5,636 Pulls 5 Tags Updated 2 weeks ago
26.3.18. Ver.2 update: This iteration is powered by 14,000+ premium Claude 4.6 Opus-style general reasoning samples, with a major focus on achieving massive gains in reasoning efficiency while actively improving peak accuracy.
3,995 Pulls 1 Tag Updated 2 weeks ago
Qwen3.5-Claude-4.6-Opus-Reasoning-Distilled-v2; https://huggingface.co/Jackrong/; has vision properly merged and efficiently quantified.
2,949 Pulls 4 Tags Updated 1 week ago
Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled is a highly capable reasoning model fine-tuned on top of the powerful Qwen3.5 architecture. The model's core directive is to leverage state-of-the-art Chain-of-Thought (CoT) distillation primarily source
2,148 Pulls 1 Tag Updated 1 week ago