mistral-nemo:12b-instruct-2407-q8

mistral-nemo:12b-instruct-2407-q8_0

3.3M Downloads Updated 6 months ago

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools 12b

ollama run mistral-nemo:12b-instruct-2407-q8_0

curl http://localhost:11434/api/chat \
  -d '{
    "model": "mistral-nemo:12b-instruct-2407-q8_0",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='mistral-nemo:12b-instruct-2407-q8_0',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'mistral-nemo:12b-instruct-2407-q8_0',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 6 months ago

6 months ago

e9b8ba00d2a1 · 13GB ·

model

archllama

parameters12.2B

quantizationQ8_0

13GB

template

{{- range $i, $_ := .Messages }} {{- if eq .Role "user" }} {{- if and $.Tools (le (len (slice $.Mess

683B

license

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

params

{ "stop": [ "[INST]", "[/INST]" ] }

30B

Readme

Mistral NeMo is a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

Reference

Blog

Hugging Face