82 1 week ago

tools thinking

1 week ago

dc56d3bc9325 · 6.7GB ·

qwen3
·
8.19B
·
Q6_K
{{- $lastUserIdx := -1 -}} {{- range $idx, $msg := .Messages -}} {{- if eq $msg.Role "user" }}{{ $la
{ "presence_penalty": 1, "temperature": 0.6 }

Readme

T-lite-it-2.1

🚨 Users are advised to exercise caution and are responsible for any additional training and oversight required to ensure the model’s responses meet acceptable ethical and safety standards. The responsibility for incorporating this model into industrial or commercial solutions lies entirely with those who choose to deploy it.

Description

T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and adds support for tool-calling capabilities — a key advancement over T-lite-it-1.0, which lacks tool-use support. Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.

More info available here: https://huggingface.co/t-tech/T-lite-it-2.1

Available quantisations

All available quantisations are presented here: https://ollama.com/t-tech/T-lite-it-2.1/tags

Quickstart

ollama run t-tech/T-lite-it-2.1:q4_K_M