Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
MichelRosselli
/
GLM-4.6-REAP-268B-A32B
:Q8_0
61
Downloads
Updated
2 weeks ago
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
Cancel
tools
thinking
Updated 2 weeks ago
2 weeks ago
4232aa9ff0de · 286GB ·
model
arch
glm4moe
·
parameters
269B
·
quantization
Q8_0
286GB
template
[gMASK]<sop> {{- if .Tools }}<|system|> # Tools You may call one or more functions to assist with th
1.8kB
params
{ "stop": [ "<|system|>", "<|user|>", "<|assistant|>" ] }
81B
Readme
No readme
Write
Preview
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)