13 1 week ago

Low latency instruct LLM by JetBrains

tools thinking
5a9b6192ddcc · 188B
Apache License 2.0
Source model: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking
GGUF BF16 quantization: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-BF16