43 1 week ago

This model aims to combine the reasoning, code, and math capabilities of Qwen3 4b 2507 reasoning by merging it with some Qwen3 4b finetunes. This model reasoning is very long.

tools thinking
a9dde3b7ca41 · 122B
{
"repeat_penalty": 1.1,
"stop": [
"<|im_start|>",
"<|im_end|>"
],
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95
}