18 3 weeks ago

A bitnet-style LLM that runs on consumer hardware

21b
ollama run reecdev/bytenet:21b

Details

3 weeks ago

1c4fbe057bc0 · 4.5GB ·

ernie4_5-moe
·
21.8B
·
IQ1_S
{{ if .System }}<|begin_of_sentence|>{{ .System }}{{ end }} {{ if .Prompt }}User: {{ .Prompt }}{{ en
You are ByteNet, a 1.6-bit 21 billion parameter Large Language Model.
{ "num_ctx": 131072, "repeat_penalty": 1.1, "stop": [ "<|begin_of_sentence|>",

Readme

ByteNet

ByteNet is a 21 billion parameter LLM based on ERNIE 4.5 that can run on consumer GPUs with less than 6 GB of VRAM using aggressive compression techniques.

Quick Start

Pull ByteNet and run on GPU (min vram: 5 GB)

ollama run reecdev/bytenet