98 1 week ago

Low latency instruct LLM by JetBrains

tools thinking
06c639a08cb4 · 188B
Apache License 2.0
Source model: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking
GGUF Q8_0 quantization: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-Q8_0