-
twwch/m3e-base
Embedding193 Pulls 1 Tag Updated 3 months ago
-
shaw/dmeta-embedding-zh-q4
The Q4_K_M quantized version of dmeta-embedding-zh.
Embedding193 Pulls 1 Tag Updated 5 months ago
-
conceptsintamil/tamil-llama-7b-instruct-v0.2
This based on GGUF model hosted in HF https://huggingface.co/abhinand/tamil-llama-7b-instruct-v0.2
7B192 Pulls 1 Tag Updated 7 months ago
-
vanilj/hermes-3-llama-3.1-8b
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.
Tools 8B191 Pulls 5 Tags Updated 4 weeks ago
-
xingyaow/codeact-agent-mistral
An LLM agent that deeply integrates with Python Interpreter.
7B190 Pulls 1 Tag Updated 5 months ago
-
a123/tiny-gpt
Tinyllama modified to be able to function like a gpt model
1B190 Pulls 1 Tag Updated 6 months ago
-
CognitiveComputations/dolphin-2.9.3-qwen2-1.5b
189 Pulls 13 Tags Updated 3 months ago
-
mannix/smaug-llama3-70b
This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct
70B189 Pulls 9 Tags Updated 3 months ago
-
fit2cloud/llama3-chinese
8B189 Pulls 2 Tags Updated 4 months ago
-
cas/discolm-german-laser
from mayflowergmbh/DiscoLM_German_7b_v1-laser
7B189 Pulls 1 Tag Updated 7 months ago
-
summerwind/cyberagent-calm2
CyberAgentLM2 is a decoder-only language model pre-trained on the 1.3T tokens of publicly available Japanese and English datasets.
7B189 Pulls 27 Tags Updated 10 months ago
-
vanilj/theia-21b-v1
An upscaled NeMo with half its layers trained
188 Pulls 7 Tags Updated 5 weeks ago
-
BlackDream/blue-orchid-2x7b
Roleplaying focused MoE Mistral model.
13B188 Pulls 1 Tag Updated 6 weeks ago
-
cnmoro/gemma2-2b-it-abliterated
2B188 Pulls 2 Tags Updated 7 weeks ago
-
eramax/dolphin-2.9.2-qwen2-7b
https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-7b
7B188 Pulls 1 Tag Updated 3 months ago
-
reefer/her2
she back and shes dirtyer
8B188 Pulls 1 Tag Updated 3 months ago
-
rouge/wizardlm-2-7b-abliterated
This is the WizardLM-2-7B model with orthogonalized bfloat16 safetensor weights, based on the implementation by @failspy
7B187 Pulls 2 Tags Updated 3 months ago
-
mrhua/llama3-8b-chinese-lora-law_f16_q4_0
8B187 Pulls 1 Tag Updated 4 months ago
-
CognitiveComputations/dolphin-yi-1.5-32k
186 Pulls 10 Tags Updated 2 months ago
-
ashishpatel26/granite-8b-code
This is IBM Release opensource code model.
8B186 Pulls 1 Tag Updated 4 months ago
-
koesn/mistral-7b-instruct
Fixed num_ctx to 32768. This Mistral 7B v0.2 Instruct model is ready to use for full model's 32k contexts window.
7B186 Pulls 5 Tags Updated 4 months ago
-
vthebeast/mythalion-13b
mythalion-13b.Q5_K_M.gguf
13B186 Pulls 1 Tag Updated 6 months ago
-
kubernetes_bad/chargen-v2
CharGen v2 helps you to write characters for role playing
7B185 Pulls 20 Tags Updated 2 months ago
-
vanilj/llama-3-8b-instruct-32k-v0.1
Llama 3 8b 32k
8B185 Pulls 11 Tags Updated 4 weeks ago
-
mikelp/starchat2-15b
StarCoder2 - Code assistant (fine-tuned) 15b model quantized to fit 16gb VRAM
185 Pulls 1 Tag Updated 5 months ago