mannix

llama3.1-8b-abliterated

Ablitered v3 llama-3.1 8b with uncensored prompt

tools

52.8K Pulls 38 Tags Updated 1 year ago

gemma2-9b-simpo

Fine-tuned google/gemma-2-9b-it on princeton-nlp/gemma2-ultrafeedback-armorm with the SimPO objective.

11.4K Pulls 24 Tags Updated 2 years ago

llama3-uncensored

llama3-8b with uncensored GuruBot prompt

9,742 Pulls 1 Tag Updated 2 years ago

llamax3-8b-alpaca

LLaMAX is a multilingual language model, developed through continued pre-training on Llama3, and supports over 100 languages

9,165 Pulls 16 Tags Updated 2 years ago

llama3.1-8b-lexi

This is an uncensored version of Llama 3.1 8B Instruct with an uncensored prompt.

tools

8,035 Pulls 45 Tags Updated 1 year ago

defog-llama3-sqlcoder-8b

A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models.

7,815 Pulls 16 Tags Updated 2 years ago

deepseek-coder-v2-lite-instruct

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

7,500 Pulls 23 Tags Updated 1 year ago

llama3-8b-ablitered-v3

Ablitered v3 llama-3 8b with uncensored prompt

4,672 Pulls 25 Tags Updated 2 years ago

qwen2-57b

Mixture-of-Experts model 57b

3,933 Pulls 18 Tags Updated 1 year ago

hermes-3-llama-3.1-8b

Hermes 3 Llama-3.1 8b Model by NousResearch

tools

3,109 Pulls 22 Tags Updated 1 year ago

dolphin-2.9-llama3-8b

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.9 with llama 3.

2,551 Pulls 2 Tags Updated 2 years ago

llava-phi3

A new small LLaVA model fine-tuned from Phi 3 Mini [I-Quants]

vision

2,026 Pulls 4 Tags Updated 2 years ago

jan-nano

Jan-Nano is a compact 4-billion parameter language model specifically designed and trained for deep research tasks.

tools thinking

1,990 Pulls 4 Tags Updated 1 year ago

qwq-32b-abilterated

QwQ is an experimental research model focused on advancing AI reasoning capabilities. Abliterated with uncensored prompt, i-matrix quants.

tools

1,901 Pulls 17 Tags Updated 1 year ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing. I-Quant models.

tools

1,806 Pulls 47 Tags Updated 1 year ago

phi3-mini-4k

Phi-3 Mini is a lightweight 3B state-of-the-art open models by Microsoft. Updated in July 2024.

1,523 Pulls 19 Tags Updated 2 years ago

dolphin-2.9.2-qwen2-72b

This model is based on Qwen2-72b, Dolphin-2.9.2 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling. Dolphin is uncensored.

1,495 Pulls 20 Tags Updated 2 years ago

gemma4-98e-v7-coder

An even more improved version of Gemma-4 98e coder variant, the best 20b coder

vision tools thinking

1,111 Pulls 27 Tags Updated 2 days ago

gemma2-9b-sppo-iter3

This model was developed using Self-Play Preference Optimization at iteration 3, based on the google/gemma-2-9b-it architecture as starting point.

1,104 Pulls 24 Tags Updated 2 years ago

gemma4-98e-v6-coder

The best 20b coding model just got better! Beats the bigger 26b brother in Python and code reasoning

vision tools thinking

1,098 Pulls 62 Tags Updated 2 days ago

smallthinker-abliterated

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model. I-Quants models, abliterated with uncensored prompt.

888 Pulls 15 Tags Updated 1 year ago

qwen3.6-27b-a3b-coder

Pruned to 184e version of 35b-a3b with LCB and MultiPL-E HE targeting for Coding

vision

864 Pulls 47 Tags Updated 2 weeks ago

gemma2-9b

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models

834 Pulls 24 Tags Updated 2 years ago

dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

800 Pulls 2 Tags Updated 2 years ago

gemma4-98e-v7-coderx

Gemma-4 98e coder max variant, top notch coding skills at the expense of science knowledge

vision tools thinking

712 Pulls 27 Tags Updated 2 days ago

omnimerge-v4-mtp

Qwen/Qwen3.6-27B + 3 Qwen3.6 fine-tunes with MLP-passthrough surgery - MTP quants

704 Pulls 8 Tags Updated 2 months ago

qwen2-math-7b

Qwen2-Math is a series of specialized math language models built upon the Qwen2 LLMs

691 Pulls 21 Tags Updated 1 year ago

llama3.1-storm

Llama-3.1-Storm-8B outperforms both Llama-3.1-8B-Instruct and Hermes-3-Llama-3.1-8B!

tools

616 Pulls 13 Tags Updated 1 year ago

deepseek-v2-lite-instruct

A strong, economical, and efficient Mixture-of-Experts language model.

599 Pulls 8 Tags Updated 2 years ago

qwq-32b

QwQ is an experimental research model focused on advancing AI reasoning capabilities. i-matrix quantizations.

tools

588 Pulls 23 Tags Updated 1 year ago

llama3-groq-tool-8b

A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

tools

571 Pulls 18 Tags Updated 2 years ago

llama3.1-70b

New state-of-the-art model from Meta available in 8B, 70B and 405B sizes

tools

568 Pulls 34 Tags Updated 1 year ago

internlm2.5-20b

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios.

557 Pulls 14 Tags Updated 1 year ago

omnimerge-v4

Qwen/Qwen3.6-27B + 3 Qwen3.6 fine-tunes with MLP-passthrough surgery

555 Pulls 27 Tags Updated 2 months ago

llama3-12b

Meta-Llama-3-12B-Instruct is a depth upscaling merge of llama3-8b from M. Labonne

527 Pulls 22 Tags Updated 2 years ago

hermes-3-llama-3.1-70b

Hermes 3 Llama-3.1 70b Model by NousResearch

tools

460 Pulls 22 Tags Updated 1 year ago

gemma4-98e-v5-coder

Pruned to 98 experts gemma-4 a4b 26b v5-coder. Best 20b coder model overall

tools thinking

449 Pulls 31 Tags Updated 2 months ago

llama-3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.

tools

409 Pulls 9 Tags Updated 1 year ago

replete-coder-llama3-8b

Replete-Coder-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.

386 Pulls 10 Tags Updated 2 years ago

smaug-llama3-8b

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B-Instruct.

361 Pulls 22 Tags Updated 2 years ago

mixtral_7bx2_moe

A high-quality Mixture of Experts (MoE) model with open weights by Mistral AI.

328 Pulls 11 Tags Updated 2 years ago

gemma2-2b

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models

324 Pulls 11 Tags Updated 2 years ago

llama3.1-8b

New state-of-the-art model from Meta available in 8B, 70B and 405B sizes.

tools

299 Pulls 41 Tags Updated 1 year ago

smaug-llama3-70b

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct

295 Pulls 9 Tags Updated 2 years ago

qwen2-7b

Qwen2 is the new series of Qwen large language models

292 Pulls 4 Tags Updated 2 years ago

llama3-sppo-iter3

Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3

265 Pulls 21 Tags Updated 2 years ago

smaug-llama3-70b-32k

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct, 32k context length

263 Pulls 4 Tags Updated 2 years ago

replete-adapted-llama3-8b

Replete-Adapted-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.

224 Pulls 17 Tags Updated 2 years ago

llama3-gradient

This model extends LLama-3 70B's context length from 8k to over 1m tokens. [I-Quants]

212 Pulls 4 Tags Updated 2 years ago

gemma4-31b-he1

Pruned/masked heads version of gemma4-31b

tools thinking

207 Pulls 26 Tags Updated 2 months ago

replete-coder-merged-8b

Replete-Coder-Merged-8b is a general purpose model that is specially trained in coding in over 100 coding languages

203 Pulls 21 Tags Updated 2 years ago

smaug-qwen2-72b

The latest in the Smaug series - a finetune of Qwen2-72B-Instruct

202 Pulls 21 Tags Updated 2 years ago

gemma4-98e-v4

Pruned to 98 experts gemma-4 a4b 26b v4

tools thinking

190 Pulls 30 Tags Updated 2 months ago

starling-lm-10.7b

Starling-LM-10.7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

161 Pulls 13 Tags Updated 2 years ago

eurus-2-7b-prime

Eurus-2-7B-PRIME is trained using PRIME (Process Reinforcement through IMplicit rEward) method, an open-source solution for online reinforcement learning (RL) with process rewards, to advance reasoning abilities of language models.

152 Pulls 23 Tags Updated 1 year ago

qwen2-math-1.5b

Qwen2-Math is a series of specialized math language models built upon the Qwen2 LLMs

149 Pulls 7 Tags Updated 1 year ago

nous-hermes2-solar-10.7b

The powerful Solar based model by Nous Research that excels at scientific discussion and coding tasks.

134 Pulls 7 Tags Updated 2 years ago

qwen2-1.5b

Qwen2 is the new series of Qwen large language models

129 Pulls 4 Tags Updated 2 years ago

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model. I-Quants models.

120 Pulls 14 Tags Updated 1 year ago

wizardlm2

State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases. More quantizations.

108 Pulls 2 Tags Updated 2 years ago

discopop-zephyr-7b-gemma

A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP

96 Pulls 5 Tags Updated 2 years ago

qwen2-0.5b

Qwen2 is the new series of Qwen large language models

94 Pulls 7 Tags Updated 2 years ago

phi3-mini-cpo-simpo

Phi-3-mini-4K-instruct with CPO-SimPO

93 Pulls 16 Tags Updated 2 years ago

gemma4-98e

Pruned to 98 experts gemma-4 a4b 26b v3

tools thinking

79 Pulls 27 Tags Updated 2 months ago

alchemistcoder-7b

AlchemistCoder is a series of coding models by InternLM. Tuned from Llama 2.

26 Pulls 3 Tags Updated 2 years ago

llama3-8b-v0.9

MaziyarPanahi/Llama-3-8B-Instruct-v0.9

3 Pulls 2 Tags Updated 2 years ago

Crazy Overclocker, Amateur Coder, AI Pragmatist, Non-sense Lover. osync developer

llama3.1-8b-abliterated

gemma2-9b-simpo

llama3-uncensored

llamax3-8b-alpaca

llama3.1-8b-lexi

defog-llama3-sqlcoder-8b

deepseek-coder-v2-lite-instruct

llama3-8b-ablitered-v3

qwen2-57b

hermes-3-llama-3.1-8b

dolphin-2.9-llama3-8b

llava-phi3

jan-nano

qwq-32b-abilterated

qwen2.5-coder

phi3-mini-4k

dolphin-2.9.2-qwen2-72b

gemma4-98e-v7-coder

gemma2-9b-sppo-iter3

gemma4-98e-v6-coder

smallthinker-abliterated

qwen3.6-27b-a3b-coder

gemma2-9b

dolphin-mixtral

gemma4-98e-v7-coderx

omnimerge-v4-mtp

qwen2-math-7b

llama3.1-storm

deepseek-v2-lite-instruct

qwq-32b

llama3-groq-tool-8b

llama3.1-70b

internlm2.5-20b

omnimerge-v4

llama3-12b

hermes-3-llama-3.1-70b

gemma4-98e-v5-coder

llama-3.3

replete-coder-llama3-8b

smaug-llama3-8b

mixtral_7bx2_moe

gemma2-2b

llama3.1-8b

smaug-llama3-70b

qwen2-7b

llama3-sppo-iter3

smaug-llama3-70b-32k

replete-adapted-llama3-8b

llama3-gradient

gemma4-31b-he1

replete-coder-merged-8b

smaug-qwen2-72b

gemma4-98e-v4

starling-lm-10.7b

eurus-2-7b-prime

qwen2-math-1.5b

nous-hermes2-solar-10.7b

qwen2-1.5b

smallthinker

wizardlm2

discopop-zephyr-7b-gemma

qwen2-0.5b

phi3-mini-cpo-simpo

gemma4-98e

alchemistcoder-7b

llama3-8b-v0.9