Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.
304.7K Pulls 6 Tags Updated 1 month ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
864.6K Pulls 3 Tags Updated 9 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
109.6M Pulls 93 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.6M Pulls 5 Tags Updated 1 year ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
929.1K Pulls 53 Tags Updated 1 year ago
OpenCoder is an open and reproducible code LLM family which includes 1.5B and 8B models, supporting chat in English and Chinese languages.
376.6K Pulls 9 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
323.9K Pulls 33 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
324.9K Pulls 33 Tags Updated 1 year ago
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
313.4K Pulls 35 Tags Updated 1 year ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
132.9K Pulls 13 Tags Updated 1 year ago
The IBM Granite Guardian 3.0 2B and 8B models are designed to detect risks in prompts and/or responses.
120.4K Pulls 10 Tags Updated 1 year ago
deepseek-v3-0324-Quants. - Q2_K is the lowest here - quantized = round((original - zero_point) / scale)
1,056 Pulls 1 Tag Updated 10 months ago
Model Name: DeepTerminal R1 Developed By: 8 Bit Labs, Model Type: Hybrid AI-Zero-Knowledge Proof Framework Framework: Solana Blockchain + DeepSeek AI + Recursive ZK Proofs License: Apache 2.0
9 Pulls 1 Tag Updated 7 months ago
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
964.2K Pulls 70 Tags Updated 1 year ago
Personal hot tub technician bot
40 Pulls 1 Tag Updated 1 year ago
Optimized 8B (qwen3-vl:8b) for OpenClaw agents. Precise JSON tool calls, <thinking> reasoning, temp 0.5, 16k ctx. Runs smoothly on 8GB VRAM laptops with minimal hallucinations.
236 Pulls 1 Tag Updated 2 days ago
A SLERP merge of Qwen3 combining instruction-following with creative writing capabilities. Models: Qwen/Qwen3-8B for Strong instruction following and reasoning + allura-org/remnant-qwen3-8b for Enhanced creative writing
41 Pulls 1 Tag Updated 6 days ago
The **Llama-3.1-8B-Instruct-STO-Master** is a high-performance fine-tune of Meta's Llama-3.1-8B-Instruct. This model represents the "Master Version" (Model E) of an extensive research project aimed at pushing the boundaries of 8B parameter architectures.
34 Pulls 1 Tag Updated 6 days ago
64 Pulls 12 Tags Updated 2 days ago
Foundation-Sec-8B Ported to Ollama Format (unchanged)
82 Pulls 1 Tag Updated 3 weeks ago