A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
3M Pulls 17 Tags Updated 4 months ago
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
157.3K Pulls 84 Tags Updated 1 year ago
123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
8,149 Pulls 6 Tags Updated 5 days ago
gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss
31.9K Pulls 3 Tags Updated 1 month ago
An experimental 1.1B parameter model trained on the new Dolphin 2.8 dataset by Eric Hartford and based on TinyLlama.
193.7K Pulls 18 Tags Updated 1 year ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
56.7K Pulls 13 Tags Updated 1 year ago
inflatebot/MN-12B-Mag-Mell-R1
25.4K Pulls 2 Tags Updated 10 months ago
Dolphin, Uncensored, trained on mistralai/Mistral-Nemo-Base-2407 (12b) 128k context
6,590 Pulls 16 Tags Updated 1 year ago
Infermatic/MN-12B-Inferor-v0.0
3,726 Pulls 2 Tags Updated 10 months ago
An uncensored model derived from Mistral-Nemo (12b). It ranks top 3 on the UGI leaderboard for models of its size. It was created by "Yamata Zen" and quantized by "team mradermacher".
3,252 Pulls 1 Tag Updated 9 months ago
2,783 Pulls 1 Tag Updated 1 year ago
cognitivecomputations/dolphin-2.9.3-mistral-nemo-12b
2,755 Pulls 2 Tags Updated 1 year ago
nothingiisreal/MN-12B-Celeste-V1.9
2,702 Pulls 5 Tags Updated 10 months ago
2,330 Pulls 6 Tags Updated 1 year ago
Mag Mell is a merge of pre-trained language models created using mergekit, based on Mistral Nemo. It is a great roleplay and storytelling model which combines the best parts of many other models to be a general purpose solution for many usecases.
2,041 Pulls 2 Tags Updated 11 months ago
Quantized version of Gemma3-12B,27B optimized for tool usage in Cline / Roo Code and complex problem solving.
1,924 Pulls 2 Tags Updated 7 months ago
TheDrummer/Rocinante-12B-v1.1
1,752 Pulls 2 Tags Updated 10 months ago
The EXAONE 4.0 model series consists of two sizes: a mid-size 32B model optimized for high performance, and a small-size 1.2B model designed for on-device applications.
1,722 Pulls 3 Tags Updated 4 months ago
This is from https://huggingface.co/TheDrummer/Rocinante-12B-v1.1-GGUF
1,710 Pulls 1 Tag Updated 1 year ago
25.3.12. Gemma3-12B-it(instruction) Q4_K_M (7.3GB)
1,705 Pulls 1 Tag Updated 9 months ago