NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.
2.4M Pulls 7 Tags Updated 2 months ago
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
4.8M Pulls 17 Tags Updated 10 months ago
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
972.5K Pulls 84 Tags Updated 2 years ago
123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
231.7K Pulls 6 Tags Updated 5 months ago
gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss
143K Pulls 3 Tags Updated 7 months ago
测试用
24 Pulls 1 Tag Updated 1 month ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
392.2K Pulls 13 Tags Updated 1 year ago
An experimental 1.1B parameter model trained on the new Dolphin 2.8 dataset by Eric Hartford and based on TinyLlama.
693.2K Pulls 18 Tags Updated 2 years ago
Fully decensored Gemma 4 12B (abliterated with Heretic) — 0/100 genuine refusals at KL 0.0284, i.e. near-zero capability loss.
1,476 Pulls 3 Tags Updated 14 hours ago
29 Pulls 6 Tags Updated 44 minutes ago
Zion is a 12b parameter model with a 2 bit quantisation making it suitable for edge devices with minimal ram available. With a 1000K context making it ideal for coding. The model needs about 5.5 Gigabytes of Memory to run and the download is just a 4.8GB.
10 Pulls 1 Tag Updated 4 days ago
An optimized version of Google's TranslateGemma-12B-it (Gemma 3) designed for high-fidelity translation. This build features hard-coded Temperature=0.1 and English Anchor support to eliminate output redundancy and maximize accuracy.
33.8K Pulls 1 Tag Updated 4 months ago
Blazingly fast chat model for conversations and tool use
1,570 Pulls 6 Tags Updated 1 month ago
LFM2.5 is a new family of hybrid models designed for on-device deployment. It builds on the LFM2 architecture with extended pre-training and reinforcement learning.
1,776 Pulls 4 Tags Updated 4 months ago
A Mistral Nemo 12B fine-tune that speaks in the voice of Oscar Wilde, the Anglo-Irish aesthete, playwright, and wit.
16 Pulls 1 Tag Updated 3 weeks ago
A Mistral Nemo 12B fine-tune that speaks in the voice of Marcus Aurelius, the Roman emperor and Stoic philosopher whose Meditations were composed in Greek on the Danubian frontier and never published in his lifetime.
13 Pulls 2 Tags Updated 3 weeks ago
LFM2.5 is a new family of hybrid models designed for on-device deployment.
423 Pulls 2 Tags Updated 2 months ago
397 Pulls 1 Tag Updated 2 months ago
MinerU 2.5 Pro (1.2B) - Q4_K_M An advanced document parsing vision-language model (VLM)
190 Pulls 1 Tag Updated 1 month ago
Source: https://huggingface.co/FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF?local-app=ollama
208 Pulls 1 Tag Updated 1 month ago