A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
3.8M Pulls 5 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
4M Pulls 14 Tags Updated 1 year ago
A strong multi-lingual general language model with competitive performance to Llama 3.
1.1M Pulls 32 Tags Updated 1 year ago
Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine.
16.5K Pulls 7 Tags Updated 1 month ago
A 3.8B model fine-tuned on a private high-quality synthetic dataset for information extraction, based on Phi-3.
509.3K Pulls 17 Tags Updated 1 year ago
A new small LLaVA model fine-tuned from Phi 3 Mini.
288.8K Pulls 4 Tags Updated 2 years ago
https://huggingface.co/pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF
114 Pulls 1 Tag Updated 2 years ago
Nous Hermes 4.3 36B parameters with thinking and tools enabled
620 Pulls 2 Tags Updated 1 month ago
Instruct version of the large language model YandexGPT 5 Lite with 8B parameters with a context length of 32k tokens. (quantised version of Q5_K_M)
530 Pulls 3 Tags Updated 9 months ago
zerofata/MS3.2-PaintedFantasy-Visage-v3-34B
147 Pulls 3 Tags Updated 9 months ago
zerofata/MS3.2-PaintedFantasy-Visage-v2-33B
41 Pulls 2 Tags Updated 10 months ago
7,186 Pulls 2 Tags Updated 1 year ago
3,929 Pulls 5 Tags Updated 1 year ago
Doc G is a high-energy Environmental Science teacher persona for Llama 3.2 who is obsessed with efficiency and saving money. He delivers punchy science lessons mixed with lectures about wasting electricity.
15 Pulls 1 Tag Updated 6 months ago
(Unsloth Dynamic Quants) A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
2,063 Pulls 3 Tags Updated 1 year ago
zerofata/MS3.2-PaintedFantasy-Visage-33B
16 Pulls 1 Tag Updated 11 months ago
MethGPT is fork of UnknownFish/waltergpt with updated system prompt and updated LLAMA (from 2 to 3)
155 Pulls 1 Tag Updated 1 year ago
Single file version with (Dynamic Quants) A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
114 Pulls 4 Tags Updated 1 year ago
23 Pulls 1 Tag Updated 1 year ago
The "Home" model is a fine tuning of the StableLM-Zephyr-3B model. It achieves a score of 97.11% score for JSON function calling accuracy.
311K Pulls 7 Tags Updated 2 years ago