-
SimonPu/llama-3-taiwan-8b-instruct-dpo
Llama-3-Taiwan-8B-Instruct-DPO is a large language model finetuned for Traditional Mandarin and English users. It has strong capabilities in language understanding, generation, reasoning, and multi-turn dialogue.
8B62 Pulls 2 Tags Updated 3 months ago
-
coderindajungle/medllama3
8B62 Pulls 1 Tag Updated 3 months ago
-
skratos115/qwen2-7b-opendevin-q4_k_m
Qwen2-7B-Instruct with OpenDevin Tool Calling
7B62 Pulls 1 Tag Updated 3 months ago
-
shadow2/gamma2
Gamma2 AI for Candy
7B62 Pulls 1 Tag Updated 5 months ago
-
wangrongsheng/aurora
š³ Aurora represents the Chinese version of the MoE model, refined from the Mixtral-8x7B architecture. It adeptly unlocks the modelās potential for bilingual dialogue in both Chinese and English across a wide range of open-domain topics.
8x7B62 Pulls 1 Tag Updated 7 months ago
-
f0rodo/peft
7B62 Pulls 1 Tag Updated 13 months ago
-
unclemusclez/unsloth-qwen2.5-coder
Qwen 2.5 Coder with Unsloth
1.5B 7B61 Pulls 63 Tags Updated 4 days ago
-
ALIENTELLIGENCE/psychologistv2
Psychologist
Tools 8B61 Pulls 1 Tag Updated 2 months ago
-
rjmalagon/dolphin-2.9.3-mistral
Tools 7B61 Pulls 2 Tags Updated 2 months ago
-
AIVA/gemma2-9b-it
9B61 Pulls 1 Tag Updated 3 months ago
-
adrienbrault/firefunction-v2
MaziyarPanahi/firefunction-v2-GGUF
61 Pulls 3 Tags Updated 3 months ago
-
aminadaven/dictalm2.0
7B61 Pulls 3 Tags Updated 5 months ago
-
vanilj/llama-3-magenta-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
61 Pulls 1 Tag Updated 5 months ago
-
galatolo/cerbero-7b-openchat
Cerbero-7b is the first 100% Free and Open Source Italian Large Language Model (LLM) ready to be used for research or commercial applications.
7B61 Pulls 1 Tag Updated 5 months ago
-
zw66/llama3-8b-gguf
Embedding61 Pulls 1 Tag Updated 5 months ago
-
vanilj/una-simplesmaug-34b-v1beta
UNA SimpleSmaug 34b v1beta Q4_K_M GGUF
34B61 Pulls 1 Tag Updated 6 months ago
-
markliou/tw-llama2
Model from Taiwan-Llama gguf
13B61 Pulls 2 Tags Updated 9 months ago
-
ALIENTELLIGENCE/uxui
UX/UI AI Developer
8B60 Pulls 1 Tag Updated 2 months ago
-
thash/darkidol-llama3-8b-v2
8B60 Pulls 1 Tag Updated 3 months ago
-
lstep/una-thepitbull-21.4b-v2
60 Pulls 1 Tag Updated 4 months ago
-
richardyoung/openbiollm
openbiollm
8B60 Pulls 1 Tag Updated 5 months ago
-
djkazic/llama-3-memgpt
8B60 Pulls 1 Tag Updated 5 months ago
-
damasak/codellama-chat-13b-chinese
13B60 Pulls 1 Tag Updated 7 months ago
-
stuehieyr/medleymd
13B parameter model, Mixture of Experts of 2 Mistral Fine Tunes, one of them expert in clinical domain.
13B60 Pulls 1 Tag Updated 9 months ago
-
themhv/neuralhermes
Original model: https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B
7B60 Pulls 1 Tag Updated 7 months ago