82 Downloads Updated 1 week ago
🚨 Users are advised to exercise caution and are responsible for any additional training and oversight required to ensure the model’s responses meet acceptable ethical and safety standards. The responsibility for incorporating this model into industrial or commercial solutions lies entirely with those who choose to deploy it.
T-lite-it-2.1 is an efficient Russian model built upon the Qwen 3 architecture, featuring significant improvements in instruction following and adds support for tool-calling capabilities — a key advancement over T-lite-it-1.0, which lacks tool-use support. Outperforms Qwen3-8B in tool calling scenarios, which is essential for agentic applications. Built for both general tasks and complex workflows, with higher Russian text generation throughput enabled by optimized tokenizer.
More info available here: https://huggingface.co/t-tech/T-lite-it-2.1
All available quantisations are presented here: https://ollama.com/t-tech/T-lite-it-2.1/tags
ollama run t-tech/T-lite-it-2.1:q4_K_M