146 3 months ago

This model is a mixed gguf q2ks format of Cerebras' GLM-4.6-REAP-218B-A32B-FP8 generated using Intel's AutoRound algorithm.

tools thinking
ollama run MichelRosselli/GLM-4.6-REAP-218B-A32B-FP8-mixed-AutoRound:Q2_K_S

Details

3 months ago

6b9803b4c7a2 · 72GB ·

glm4moe
·
218B
·
Q2_K_S
[gMASK]<sop> {{- if .Tools }}<|system|> # Tools You may call one or more functions to assist with th
{ "stop": [ "<|system|>", "<|user|>", "<|assistant|>" ] }

Readme

No readme