18 3 weeks ago

A bitnet-style LLM that runs on consumer hardware

21b
ollama run reecdev/bytenet

Models

View all →

Readme

ByteNet

ByteNet is a 21 billion parameter LLM based on ERNIE 4.5 that can run on consumer GPUs with less than 6 GB of VRAM using aggressive compression techniques.

Quick Start

Pull ByteNet and run on GPU (min vram: 5 GB)

ollama run reecdev/bytenet