frob/qwen3.6

frob/ qwen3.6

14 Downloads Updated 1 week ago

vision tools thinking

ollama run frob/qwen3.6:35b-reflection

curl http://localhost:11434/api/chat \
  -d '{
    "model": "frob/qwen3.6:35b-reflection",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='frob/qwen3.6:35b-reflection',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'frob/qwen3.6:35b-reflection',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model frob/qwen3.6:35b-reflection

Codex App

Codex App ollama launch codex-app --model frob/qwen3.6:35b-reflection

OpenClaw

OpenClaw ollama launch openclaw --model frob/qwen3.6:35b-reflection

Hermes Agent

Hermes Agent ollama launch hermes --model frob/qwen3.6:35b-reflection

Codex

Codex ollama launch codex --model frob/qwen3.6:35b-reflection

OpenCode

OpenCode ollama launch opencode --model frob/qwen3.6:35b-reflection

Models

Name

1 model

Size / Usage

Context

Input

qwen3.6:35b-reflection

24GB · 256K context window · Text, Image · 1 week ago

qwen3.6:35b-reflection

24GB

256K

Text, Image

Readme

By default, the template for qwen35/qwen35moe models discards thinking that occurs before the most recent user message. This is in accord with the default behaviour of the Jinja template for the model. However, the template also supports preserving earlier thinking, which is not supported in ollama due to the inability to pass in random parameters.

This template adds two roles, preserve_thinking and enable_thinking, mirroring the control variables in the Jinja template. These can be used to control thinking and preservation by sending messages rather than setting variables.

$ curl -s http://localhost:11434/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "frob/qwen3.6:35b-reflection",
    "messages": [
      { "role": "preserve_thinking", "content": "true" },
      { "role": "user", "content": "Choose a fruit but just reply with ok." },
      { "role": "assistant",
        "content": "ok", 
        "reasoning": "I selected avocado. Remember this for later if asked."
      },
      { "role": "user", "content": "Which fruit did you pick?" }
    ]
  }' | jq .choices[0].message
{
  "role": "assistant",
  "content": "Avocado"
}

By default, the template for qwen35/qwen35moe models discards thinking that occurs before the most recent user message.  This is in accord with the default behaviour of the [Jinja template](https://huggingface.co/Qwen/Qwen3.6-35B-A3B/blob/main/chat_template.jinja) for the model.  However, the template also supports preserving earlier thinking, which is not supported in ollama due to the inability to pass in random parameters.

This template adds two roles, `preserve_thinking` and `enable_thinking`, mirroring the control variables in the Jinja template.  These can be used to control thinking and preservation by sending messages rather than setting variables.

```console
$ curl -s http://localhost:11434/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "frob/qwen3.6:35b-reflection",
    "messages": [
      { "role": "preserve_thinking", "content": "true" },
      { "role": "user", "content": "Choose a fruit but just reply with ok." },
      { "role": "assistant",
        "content": "ok", 
        "reasoning": "I selected avocado. Remember this for later if asked."
      },
      { "role": "user", "content": "Which fruit did you pick?" }
    ]
  }' | jq .choices[0].message
{
  "role": "assistant",
  "content": "Avocado"
}
```

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)