oamazonasgabriel/qwen3.6-35b-a3b:q4-24gbGPU/params

oamazonasgabriel/ qwen3.6-35b-a3b:q4-24gbGPU

822 Downloads Updated 1 month ago

A memory-efficient model configuration of Qwen3.6-35B-A3B using an upstream imatrix-calibrated IQ4_XS quantization and q4_0 KV cache. Designed for 24 GB VRAM

tools thinking

params

a6b253d76a2f · 180B

{

"min_p": 0,

"num_ctx": 16384,

"num_gpu": 99,

"presence_penalty": 1.5,

"repeat_penalty": 1,

"stop": [

"<|im_start|>",

"<|im_end|>"

"temperature": 1,

"top_k": 20,

"top_p": 0.95

}