A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

2,385 2 months ago

Readme

Note: this model requires Ollama 0.5.5 or later.