deepseek-llm:67b-base-q5

deepseek-llm:67b-base-q5_0

1.2M Downloads Updated 2 years ago

An advanced language model crafted with 2 trillion bilingual tokens.

7b 67b

ollama run deepseek-llm:67b-base-q5_0

curl http://localhost:11434/api/chat \
  -d '{
    "model": "deepseek-llm:67b-base-q5_0",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='deepseek-llm:67b-base-q5_0',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'deepseek-llm:67b-base-q5_0',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

17e77ba3d4e0 · 46GB ·

model

archllama

parameters67.4B

quantizationQ5_0

46GB

params

{ "num_ctx": 4096 }

17B

Readme

DeepSeek LLM is an advanced language model available in both 7 billion and 67 billion parameters. Both a chat and base variation are available.

Superior General Capabilities: DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.
Proficient in Coding and Math: DeepSeek LLM 67B Chat exhibits outstanding performance in coding (using the HumanEval benchmark) and mathematics (using the GSM8K benchmark).

References

GitHub

HuggingFace