29 Downloads Updated 9 months ago
ollama run sivab14/granite-thinking
ollama launch claude --model sivab14/granite-thinking
ollama launch codex --model sivab14/granite-thinking
ollama launch opencode --model sivab14/granite-thinking
ollama launch openclaw --model sivab14/granite-thinking
The IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities. These models deliver significant gains on benchmarks for measuring generic performance including AlpacaEval-2.0 and Arena-Hard, and improvements in mathematics, coding, and instruction following. They also supports Fill-in-the-Middle (FIM) for code completion tasks and structured reasoning.
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese. However, users may finetune this Granite model for languages beyond these 12 languages. Intended Use
These models are designed to handle general instruction-following tasks and can be integrated into AI assistants across various domains, including business applications.
To enable thinking, add a message with “role”: “control” and set “content” to “thinking”. For example:
{
"messages": [
{"role": "control", "content": "thinking"},
{"role": "user", "content": "How do I get to the airport if my car won't start?"}
]
}
In this model 8B parameter model, I’ve modified the modelfile to enable thinking always, so it doesn’t need to be explicitly mentioned by the user to use it. I have only made that change, the model is created and fine-tuned by IBM, so the full credits go to them.😇