LFM2.5-8B-A1B, an edge model built for fast, reliable tool calling on consumer hardware.
32.8K Pulls 5 Tags Updated 3 weeks ago
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.
481K Pulls 6 Tags Updated 6 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
116.3M Pulls 93 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.9M Pulls 5 Tags Updated 1 year ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
1M Pulls 3 Tags Updated 1 year ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
2M Pulls 53 Tags Updated 2 years ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
971.8K Pulls 33 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
974.4K Pulls 33 Tags Updated 1 year ago
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
952.9K Pulls 35 Tags Updated 2 years ago
OpenCoder is an open and reproducible code LLM family which includes 1.5B and 8B models, supporting chat in English and Chinese languages.
603K Pulls 9 Tags Updated 1 year ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
393.4K Pulls 13 Tags Updated 1 year ago
The IBM Granite Guardian 3.0 2B and 8B models are designed to detect risks in prompts and/or responses.
319.9K Pulls 10 Tags Updated 1 year ago
4 Pulls 3 Tags Updated 2 days ago
3 Pulls 3 Tags Updated 2 days ago
2 Pulls 1 Tag Updated 4 days ago
deepseek-v3-0324-Quants. - Q2_K is the lowest here - quantized = round((original - zero_point) / scale)
1,131 Pulls 1 Tag Updated 1 year ago
Model Name: DeepTerminal R1 Developed By: 8 Bit Labs, Model Type: Hybrid AI-Zero-Knowledge Proof Framework Framework: Solana Blockchain + DeepSeek AI + Recursive ZK Proofs License: Apache 2.0
60 Pulls 1 Tag Updated 1 year ago
用于生成二次元动漫风格图片的 480B 大模型
92 Pulls 1 Tag Updated 2 months ago
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
1.8M Pulls 70 Tags Updated 1 year ago
Personal hot tub technician bot
43 Pulls 1 Tag Updated 2 years ago