437 1 month ago

An attempt to compress Qwen3.5 into 500M and 1.5B parameters.

tools thinking 500m 1.5b
9371364b27a5 · 65B
{
"presence_penalty": 1.5,
"temperature": 1,
"top_k": 20,
"top_p": 0.95
}