Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
107.7M Pulls 93 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.5M Pulls 5 Tags Updated 11 months ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
773K Pulls 3 Tags Updated 8 months ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
637.8K Pulls 53 Tags Updated 1 year ago
OpenCoder is an open and reproducible code LLM family which includes 1.5B and 8B models, supporting chat in English and Chinese languages.
271.5K Pulls 9 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
151.4K Pulls 33 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
150.4K Pulls 33 Tags Updated 11 months ago
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
141.4K Pulls 35 Tags Updated 1 year ago
The IBM Granite Guardian 3.0 2B and 8B models are designed to detect risks in prompts and/or responses.
61.5K Pulls 10 Tags Updated 1 year ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
57.2K Pulls 13 Tags Updated 1 year ago
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.
9,911 Pulls 6 Tags Updated 6 days ago
deepseek-v3-0324-Quants. - Q2_K is the lowest here - quantized = round((original - zero_point) / scale)
1,041 Pulls 1 Tag Updated 8 months ago
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
774.8K Pulls 70 Tags Updated 12 months ago
Personal hot tub technician bot
39 Pulls 1 Tag Updated 1 year ago
NousResearch/Hermes-2-Theta-Llama-3-8B
588.6K Pulls 5 Tags Updated 1 year ago
Alibaba's text reranking model.Qwen3-Reranker-8B has the following features: Model Type: Text Reranking. Supported Languages: 100+ Languages. Number of Paramaters: 8B. Context Length: 32k.
196.4K Pulls 5 Tags Updated 6 months ago
Kimina-Prover-Distill-8B is a theorem proving model developed by Project Numina and Kimi teams, focusing on competition style problem solving capabilities in Lean 4. It is a distillation of Kimina-Prover-72B.
118.2K Pulls 1 Tag Updated 5 months ago
NousResearch/Hermes-2-Pro-Llama-3-8B
89K Pulls 5 Tags Updated 1 year ago
Llama-3.1-Storm-8B outperforms both Llama-3.1-8B-Instruct and Hermes-3-Llama-3.1-8B! 🤗 Blog: https://huggingface.co/blog/akjindal53244/llama31-storm8b
37.4K Pulls 3 Tags Updated 1 year ago
Ablitered v3 llama-3.1 8b with uncensored prompt
33.1K Pulls 38 Tags Updated 1 year ago