mistral-nemo:12b-instruct-2407-q5_K

mistral-nemo:12b-instruct-2407-q5_K_S

3M Downloads Updated 4 months ago

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools 12b

Updated 4 months ago

4 months ago

40374746451f · 8.5GB ·

model

archllama

parameters12.2B

quantizationQ5_K_S

8.5GB

license

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

params

{ "stop": [ "[INST]", "[/INST]" ] }

30B

template

{{- range $i, $_ := .Messages }} {{- if eq .Role "user" }} {{- if and $.Tools (le (len (slice $.Mess

683B

Readme

Mistral NeMo is a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

Reference

Blog

Hugging Face