Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
MichelRosselli
/
GLM-4.6-REAP-268B-A32B
:Q6_K
61
Downloads
Updated
2 weeks ago
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
Cancel
tools
thinking
Updated 2 weeks ago
2 weeks ago
ce8b5e74ee10 · 221GB ·
model
arch
glm4moe
·
parameters
269B
·
quantization
Q6_K
221GB
template
[gMASK]<sop> {{- if .Tools }}<|system|> # Tools You may call one or more functions to assist with th
1.8kB
params
{ "stop": [ "<|system|>", "<|user|>", "<|assistant|>" ] }
81B
Readme
No readme
Write
Preview
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)