second_constantine/gpt-oss-u:20b

second_constantine/

gpt-oss-u:20b

23.3K Downloads Updated 2 months ago

Specialized uncensored/abliterated quants for new OpenAI 20B MOE - Mixture of Experts Model at 80+ T/S (quantized Q5_1)

thinking 20b

Updated 2 months ago

2 months ago

dfee3e8688f3 · 16GB ·

archgpt-oss

parameters20.9B

quantizationQ5_1

16GB

<|start|>system<|message|>You are ChatGPT, a large language model trained by OpenAI. Knowledge cutof

1.9kB

{ "temperature": 1 }

18B

Device	Speed, token/s	Context	VRAM, gb	Versions
RTX 3090 24gb	~143	8192	16	Q5_1, 0.12.2
RTX 3090 24gb	~136	16384	16	Q5_1, 0.12.2
M1 Max 32gb	~60	8192	16	Q5_1, 0.12.2
M1 Max 32gb	~60	16384	16	Q5_1, 0.12.2