Hi. RDson@🤗
-
mistral-nemo-12b-celeste-v1.9
898 Pulls 6 Tags Updated 3 months ago
-
midnight-miqu-70b-v1.5
Midnight-Miqu-70B-v1.5-GGUF Q4_K_S & Q4_K_M
806 Pulls 2 Tags Updated 7 months ago
-
supernova-medius
Arcee-SuperNova-Medius is a 14B parameter language model developed by Arcee.ai, built on the Qwen2.5-14B-Instruct architecture.
tools617 Pulls 22 Tags Updated 5 weeks ago
-
smaug-llama-3-70b-instruct
This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.
507 Pulls 11 Tags Updated 6 months ago
-
llama-3-8b-instruct-coder-v2
Llama-3-8B-Instruct-Coder-v2
365 Pulls 9 Tags Updated 6 months ago
-
llama-3.1-70b-instruct-lorablated-iq2_xs
tools325 Pulls 1 Tag Updated 3 months ago
-
gemma-2-ataraxy-9b
Made from Gemma 2 9B SPPO iter3 and SimPO
267 Pulls 19 Tags Updated 2 months ago
-
reflection-70b-iq2_xxs
Reflection Llama-3.1 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.
265 Pulls 1 Tag Updated 2 months ago
-
hermes-3-llama-3.1-8b
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.
tools264 Pulls 5 Tags Updated 3 months ago
-
theia-21b-v1
An upscaled NeMo with half its layers trained
229 Pulls 7 Tags Updated 3 months ago
-
llama-3-8b-instruct-32k-v0.1
Llama 3 8b 32k
189 Pulls 11 Tags Updated 7 months ago
-
calme-2.4-rys-78b
This model is a fine-tuned version of the dnhkng/RYS-XLarge, pushing the boundaries of natural language understanding and generation even further.
tools149 Pulls 2 Tags Updated 2 months ago
-
einstein-v6.1-llama3-8b
Weyaxi/Einstein-v6.1-Llama3-8B
124 Pulls 11 Tags Updated 6 months ago
-
tess-v2.5-qwen2-72b
Tess-v2.5 (Qwen2-72B) was fine-tuned over the newly released Qwen2-72B base, using the Tess-v2.5 dataset that contain 300K samples spanning multiple topics.
122 Pulls 3 Tags Updated 5 months ago
-
llama3.1-70b-iquants
Llama 3.1 70b IQs: IQ1_M, IQ2_M, IQ2_S, IQ2_XS, IQ2_XXS, IQ3_XS, IQ4_XS
tools107 Pulls 8 Tags Updated 3 months ago
-
llama-3-peach-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
104 Pulls 2 Tags Updated 6 months ago
-
palmyra-fin-70b-32k
Palmyra-Fin-70B-32K is a model built by Writer specifically to meet the needs of the financial industry. It is a leading LLM on financial benchmarks, outperforming other large language models in various financial tasks and evaluations.
95 Pulls 7 Tags Updated 3 months ago
-
mistral-nemo-gutenberg-12b-v2
axolotl-ai-co/romulus-mistral-nemo-12b-simpo finetuned on jondurbin/gutenberg-dpo-v0.1
tools76 Pulls 1 Tag Updated 2 months ago
-
orca-llama-3-8b-instruct
Orca-Llama-3-8B-Instruct-DPO
71 Pulls 2 Tags Updated 7 months ago
-
una-simplesmaug-34b-v1beta
UNA SimpleSmaug 34b v1beta Q4_K_M GGUF
67 Pulls 1 Tag Updated 7 months ago
-
llama-3-magenta-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
61 Pulls 1 Tag Updated 6 months ago
-
llama-3-14b-instruct-v1
Self-merge Llama 3 14B Instruct
60 Pulls 2 Tags Updated 7 months ago
-
qwen2.5-coder-32b-instruct-iq4_xs
Perfect size for 24GB GPUs!
tools58 Pulls 1 Tag Updated 9 days ago
-
command-r-08-2024-q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
43 Pulls 1 Tag Updated 2 months ago
-
mixtral_34bx2_moe_60b
Mixtral_34Bx2_MoE_60B GGUF Q4_K_M
39 Pulls 1 Tag Updated 7 months ago
-
cathallama-70b-i1-iq2_s
Perfect for 24GB cards
tools34 Pulls 1 Tag Updated 3 months ago
-
qwen2.5-32b-instruct_iq4_xs
Qwen2.5 32B Instruct IQ4_XS
tools27 Pulls 1 Tag Updated 6 weeks ago
-
llama-3.1-instruct-bellman-8b-swedish
This version of bellman is finetuned from llama-3.1-instruct-8b. It's finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions.
tools25 Pulls 3 Tags Updated 6 weeks ago
-
trinity-2-codestral-22b-v0.2
Trinity is a coding specific Large Language Model series created by Migel Tissera.
24 Pulls 2 Tags Updated 2 months ago
-
rys-xlarge-iq3_xs
This is a new kind of model optimization. This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which was tuned from Qwen2-72B.
22 Pulls 1 Tag Updated 2 months ago
-
mistral-large-instruct-2407-iq3_xx
tools21 Pulls 1 Tag Updated 2 months ago
-
qwen2.5-72b-instruct-iq3_xxs
Qwen2.5 is the latest series of Qwen large language models.
tools14 Pulls 1 Tag Updated 2 months ago
-
qwen2.5-14b-instruct-iq4_xs
Qwen2.5 14B Instruct IQ4_XS
tools13 Pulls 1 Tag Updated 6 weeks ago
-
athene-v2-chat-iq3_xs
Athene-V2-Chat-IQ3_XS
tools7 Pulls 1 Tag Updated 6 days ago
-
command-r-08-2024:q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.