21 1 week ago

Low latency instruct LLM by JetBrains

tools thinking
e9df9d062b74 · 198B
Apache License 2.0
Source model: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking
GGUF MXFP4_MOE quantization: https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-MXFP4_MOE