nemotron

600.6K Downloads Updated 1 year ago

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

tools 70b

ollama run nemotron

curl http://localhost:11434/api/chat \
  -d '{
    "model": "nemotron",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='nemotron',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'nemotron',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code ollama launch claude --model nemotron

OpenCode ollama launch opencode --model nemotron

Hermes Agent ollama launch hermes --model nemotron

OpenClaw ollama launch openclaw --model nemotron

Models

View all →

Name

17 models

Size / Usage

Context

Input

nemotron:latest

43GB · 128K context window · Text · 1 year ago

nemotron:latest

43GB

128K

Text

nemotron:70b

latest

43GB · 128K context window · Text · 1 year ago

nemotron:70b latest

43GB

128K

Text

Readme

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

This model was trained using RLHF (specifically, REINFORCE), Llama-3.1-Nemotron-70B-Reward and HelpSteer2-Preference prompts on a Llama-3.1-70B-Instruct model as the initial policy.

$ ollama run nemotron
>>> How many r in strawberry?
A sweet question!

Let's count the "R"s in "strawberry":

1. S
2. T
3. R
4. A
5. W
6. B
7. E
8. R
9. R
10. Y

There are **3** "R"s in the word "strawberry".

Reference

Hugging Face