1,464 Downloads Updated 1 month ago
ollama run guzesqdro/Claude_Sonnet_4.6_Reduced
ollama launch claude --model guzesqdro/Claude_Sonnet_4.6_Reduced
ollama launch openclaw --model guzesqdro/Claude_Sonnet_4.6_Reduced
ollama launch hermes --model guzesqdro/Claude_Sonnet_4.6_Reduced
ollama launch codex --model guzesqdro/Claude_Sonnet_4.6_Reduced
ollama launch opencode --model guzesqdro/Claude_Sonnet_4.6_Reduced
This is an optimized lightweight version of a Claude Sonnet–style assistant designed for fast and efficient performance on older or low-end devices. It prioritizes speed, reduced memory usage, and quick response generation while maintaining high-quality natural language output.
The system is powered by a Qwen 2.5-based architecture, fine-tuned and adapted to deliver concise, coherent, and context-aware responses. It has been optimized to reduce latency and computational load, making it suitable for environments where full-scale large language models may be too resource-intensive.
Rather than replicating any specific proprietary model, this version is inspired by modern conversational AI assistants and focuses on delivering a similar user experience in terms of clarity, reasoning ability, and helpfulness, while remaining lightweight and efficient.
Made with ❤️ by guzesqdro 🥳