mistral-nemo:12b-instruct-2407-q5

mistral-nemo:12b-instruct-2407-q5_1

2.8M Downloads Updated 3 months ago

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools 12b

Updated 3 months ago

3 months ago

8898d5e38428 · 9.2GB ·

model

archllama

parameters12.2B

quantizationQ5_1

9.2GB

template

{{- range $i, $_ := .Messages }} {{- if eq .Role "user" }} {{- if and $.Tools (le (len (slice $.Mess

683B

license

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

params

{ "stop": [ "[INST]", "[/INST]" ] }

30B

Readme

Mistral NeMo is a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

Reference

Blog

Hugging Face