Hi. RDson@🤗
-
Phi-4
Microsoft's Phi 4 model
6,300 Pulls 5 Tags Updated 6 days ago
-
mistral-nemo-12b-celeste-v1.9
1,040 Pulls 6 Tags Updated 4 months ago
-
midnight-miqu-70b-v1.5
Midnight-Miqu-70B-v1.5-GGUF Q4_K_S & Q4_K_M
861 Pulls 2 Tags Updated 8 months ago
-
supernova-medius
Arcee-SuperNova-Medius is a 14B parameter language model developed by Arcee.ai, built on the Qwen2.5-14B-Instruct architecture.
tools850 Pulls 22 Tags Updated 2 months ago
-
smaug-llama-3-70b-instruct
This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.
518 Pulls 11 Tags Updated 7 months ago
-
llama-3-8b-instruct-coder-v2
Llama-3-8B-Instruct-Coder-v2
375 Pulls 9 Tags Updated 7 months ago
-
llama-3.1-70b-instruct-lorablated-iq2_xstools
343 Pulls 1 Tag Updated 4 months ago
-
gemma-2-ataraxy-9b
Made from Gemma 2 9B SPPO iter3 and SimPO
321 Pulls 19 Tags Updated 3 months ago
-
reflection-70b-iq2_xxs
Reflection Llama-3.1 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.
284 Pulls 1 Tag Updated 3 months ago
-
hermes-3-llama-3.1-8b
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.
tools272 Pulls 5 Tags Updated 4 months ago
-
theia-21b-v1
An upscaled NeMo with half its layers trained
238 Pulls 7 Tags Updated 4 months ago
-
llama-3-8b-instruct-32k-v0.1
Llama 3 8b 32k
190 Pulls 11 Tags Updated 8 months ago
-
calme-2.4-rys-78b
This model is a fine-tuned version of the dnhkng/RYS-XLarge, pushing the boundaries of natural language understanding and generation even further.
tools181 Pulls 2 Tags Updated 3 months ago
-
tess-v2.5-qwen2-72b
Tess-v2.5 (Qwen2-72B) was fine-tuned over the newly released Qwen2-72B base, using the Tess-v2.5 dataset that contain 300K samples spanning multiple topics.
135 Pulls 3 Tags Updated 6 months ago
-
einstein-v6.1-llama3-8b
Weyaxi/Einstein-v6.1-Llama3-8B
127 Pulls 11 Tags Updated 7 months ago
-
qwen2.5-coder-32b-instruct-iq4_xs
Perfect size for 24GB GPUs!
tools125 Pulls 1 Tag Updated 5 weeks ago
-
llama3.1-70b-iquants
Llama 3.1 70b IQs: IQ1_M, IQ2_M, IQ2_S, IQ2_XS, IQ2_XXS, IQ3_XS, IQ4_XS
tools119 Pulls 8 Tags Updated 4 months ago
-
palmyra-fin-70b-32k
Palmyra-Fin-70B-32K is a model built by Writer specifically to meet the needs of the financial industry. It is a leading LLM on financial benchmarks, outperforming other large language models in various financial tasks and evaluations.
116 Pulls 7 Tags Updated 4 months ago
-
llama-3-peach-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
106 Pulls 2 Tags Updated 7 months ago
-
mistral-nemo-gutenberg-12b-v2
axolotl-ai-co/romulus-mistral-nemo-12b-simpo finetuned on jondurbin/gutenberg-dpo-v0.1
tools94 Pulls 1 Tag Updated 3 months ago
-
orca-llama-3-8b-instruct
Orca-Llama-3-8B-Instruct-DPO
74 Pulls 2 Tags Updated 8 months ago
-
una-simplesmaug-34b-v1beta
UNA SimpleSmaug 34b v1beta Q4_K_M GGUF
69 Pulls 1 Tag Updated 8 months ago
-
llama-3-14b-instruct-v1
Self-merge Llama 3 14B Instruct
62 Pulls 2 Tags Updated 8 months ago
-
llama-3-magenta-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
61 Pulls 1 Tag Updated 7 months ago
-
qwen2.5-32b-instruct_iq4_xs
Qwen2.5 32B Instruct IQ4_XS
tools49 Pulls 1 Tag Updated 2 months ago
-
command-r-08-2024-q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
48 Pulls 1 Tag Updated 3 months ago
-
llama-3.1-instruct-bellman-8b-swedish
This version of bellman is finetuned from llama-3.1-instruct-8b. It's finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions.
tools40 Pulls 3 Tags Updated 2 months ago
-
athene-v2-chat-iq3_xs
Athene-V2-Chat-IQ3_XS
tools40 Pulls 1 Tag Updated 5 weeks ago
-
mixtral_34bx2_moe_60b
Mixtral_34Bx2_MoE_60B GGUF Q4_K_M
39 Pulls 1 Tag Updated 8 months ago
-
cathallama-70b-i1-iq2_s
Perfect for 24GB cards
tools34 Pulls 1 Tag Updated 4 months ago
-
trinity-2-codestral-22b-v0.2
Trinity is a coding specific Large Language Model series created by Migel Tissera.
25 Pulls 2 Tags Updated 3 months ago
-
rys-xlarge-iq3_xs
This is a new kind of model optimization. This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which was tuned from Qwen2-72B.
22 Pulls 1 Tag Updated 3 months ago
-
mistral-large-instruct-2407-iq3_xxtools
21 Pulls 1 Tag Updated 3 months ago
-
qwen2.5-14b-instruct-iq4_xs
Qwen2.5 14B Instruct IQ4_XS
tools19 Pulls 1 Tag Updated 2 months ago
-
qwen2.5-72b-instruct-iq3_xxs
Qwen2.5 is the latest series of Qwen large language models.
tools17 Pulls 1 Tag Updated 3 months ago
-
command-r-08-2024:q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.