-
mannix/smaug-qwen2-72b
The latest in the Smaug series - a finetune of Qwen2-72B-Instruct
72B100 Pulls 21 Tags Updated 3 months ago
-
mannix/replete-coder-merged-8b
Replete-Coder-Merged-8b is a general purpose model that is specially trained in coding in over 100 coding languages
8B146 Pulls 21 Tags Updated 3 months ago
-
mannix/replete-coder-llama3-8b
Replete-Coder-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.
210 Pulls 10 Tags Updated 3 months ago
-
mannix/replete-adapted-llama3-8b
Replete-Adapted-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.
8B142 Pulls 17 Tags Updated 3 months ago
-
akx/viking-7b
Viking 7B is a model pretrained on Finnish, English, Swedish, Danish, Norwegian, Icelandic and code.
8B394 Pulls 1 Tag Updated 5 months ago
-
mayflowergmbh/wiedervereinigung
This is a dpo aligned merge of our favourite german models, scoring 7.11 on the mt-bench-de average.
7B334 Pulls 1 Tag Updated 6 months ago
-
samge/parrot
Repeat what the user says like a parrot.
8B18 Pulls 2 Tags Updated 5 months ago
-
mannix/discopop-zephyr-7b-gemma
A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP
29 Pulls 5 Tags Updated 4 months ago
-
nqduc/mixsura
State-of-the-art MoE Large Language Model for Vietnamese
8x7B101 Pulls 19 Tags Updated 6 months ago
-
nqduc/mixsura-sft
State-of-the-art MoE Large Language Model for Vietnamese
8x7B50 Pulls 19 Tags Updated 6 months ago
-
schroneko/gemma-2-2b-jpn-it
Gemma-2-JPN is a Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language with the same level of performance of English only queries on Gemma 2.
2B99 Pulls 14 Tags Updated 10 days ago
-
mannix/llama3.1-70b
New state-of-the-art model from Meta available in 8B, 70B and 405B sizes
Tools 70B120 Pulls 34 Tags Updated 2 months ago
-
mannix/llama3.1-8b
New state-of-the-art model from Meta available in 8B, 70B and 405B sizes.
Tools 8B72 Pulls 41 Tags Updated 2 months ago
-
arcee-ai/arcee-spark
Arcee Spark is a powerful 7B parameter language model that punches well above its weight class.
256 Pulls 1 Tag Updated 3 months ago
-
datouxia/llama3-8b-chinese-chat-q8-v2
导入自https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit/
8B433 Pulls 1 Tag Updated 5 months ago
-
cas/wiedervereinigung-7b-dpo-laser
from mayflowergmbh/Wiedervereinigung-7b-dpo-laser-GGUF
7B160 Pulls 1 Tag Updated 8 months ago
-
vanilj/llama3.1-70b-iquants
Llama 3.1 70b IQs: IQ1_M, IQ2_M, IQ2_S, IQ2_XS, IQ2_XXS, IQ3_XS, IQ4_XS
Tools 70B85 Pulls 8 Tags Updated 2 months ago
-
lrs33/ragideal-chat
RAGIdeal-Chat LLM
14B5 Pulls 1 Tag Updated 4 days ago
-
Accessibles/catallama-v0.2-instruct-sft-dpo-merged
CataLlama-v0.2-Instruct-SFT-DPO-Merged is a merge between catallama/CataLlama-v0.2-Instruct-SFT and catallama/CataLlama-v0.2-Instruct-DPO
8B2 Pulls 1 Tag Updated 2 months ago
-
Luzivx/marsa-maroc-model
8B1 Pull 1 Tag Updated 2 months ago
-
Luzivx/luzivila-model
8B1 Tag Updated 2 months ago