step-3 · Ollama

deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

3.8M Pulls 5 Tags Updated 1 year ago

llama3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

tools 70b

4M Pulls 14 Tags Updated 1 year ago

glm4

A strong multi-lingual general language model with competitive performance to Llama 3.

9b

1.1M Pulls 32 Tags Updated 1 year ago

laguna-xs.2

Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine.

tools thinking

16.5K Pulls 7 Tags Updated 1 month ago

nuextract

A 3.8B model fine-tuned on a private high-quality synthetic dataset for information extraction, based on Phi-3.

3.8b

509.3K Pulls 17 Tags Updated 1 year ago

llava-phi3

A new small LLaVA model fine-tuned from Phi 3 Mini.

vision 3.8b

288.8K Pulls 4 Tags Updated 2 years ago

pacozaa/tinyllama

https://huggingface.co/pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF

114 Pulls 1 Tag Updated 2 years ago

steelpuddles/hermes-4.3-36B

Nous Hermes 4.3 36B parameters with thinking and tools enabled

tools thinking

620 Pulls 2 Tags Updated 1 month ago

second_constantine/yandex-gpt-5-lite

Instruct version of the large language model YandexGPT 5 Lite with 8B parameters with a context length of 32k tokens. (quantised version of Q5_K_M)

8b

530 Pulls 3 Tags Updated 9 months ago

ScrambieBambie/MS3.2-PaintedFantasy-Visage-v3-34B

zerofata/MS3.2-PaintedFantasy-Visage-v3-34B

tools

147 Pulls 3 Tags Updated 9 months ago

ScrambieBambie/MS3.2-PaintedFantasy-Visage-v2-33B

zerofata/MS3.2-PaintedFantasy-Visage-v2-33B

tools

41 Pulls 2 Tags Updated 10 months ago

huihui_ai/deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

7,186 Pulls 2 Tags Updated 1 year ago

huihui_ai/deepseek-v3-abliterated

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

3,929 Pulls 5 Tags Updated 1 year ago

potatocannon11/doc-g

Doc G is a high-energy Environmental Science teacher persona for Llama 3.2 who is obsessed with efficiency and saving money. He delivers punchy science lessons mixed with lectures about wasting electricity.

tools

15 Pulls 1 Tag Updated 6 months ago

milkey/deepseek-v3-UD

(Unsloth Dynamic Quants) A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

2,063 Pulls 3 Tags Updated 1 year ago

ScrambieBambie/MS3.2-PaintedFantasy-Visage-33B

zerofata/MS3.2-PaintedFantasy-Visage-33B

tools

16 Pulls 1 Tag Updated 11 months ago

sm1sek/methgpt

MethGPT is fork of UnknownFish/waltergpt with updated system prompt and updated LLAMA (from 2 to 3)

155 Pulls 1 Tag Updated 1 year ago

org/deepseek-v3-fast

Single file version with (Dynamic Quants) A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

114 Pulls 4 Tags Updated 1 year ago

lucataco/deepseek-v3-64k

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

23 Pulls 1 Tag Updated 1 year ago

fixt/home-3b-v3

The "Home" model is a fine tuning of the StableLM-Zephyr-3B model. It achieves a score of 97.11% score for JSON function calling accuracy.

311K Pulls 7 Tags Updated 2 years ago