sharky172/ qwen3.6:27b-mtp-q4_K_M-512k

851 2 weeks ago

Qwen 3.6, MTP-enabled, 512k context (20GB KV cache footprint with OLLAMA_KV_CACHE_TYPE=q8_0)

1217b8c3d63f · 19B
{
"num_ctx": 524288
}