cyberuser42/DeepSeek-R1-Distill-Qwen-7B

cyberuser42/ DeepSeek-R1-Distill-Qwen-7B

246 Downloads Updated 1 year ago

ollama run cyberuser42/DeepSeek-R1-Distill-Qwen-7B

curl http://localhost:11434/api/chat \
  -d '{
    "model": "cyberuser42/DeepSeek-R1-Distill-Qwen-7B",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='cyberuser42/DeepSeek-R1-Distill-Qwen-7B',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'cyberuser42/DeepSeek-R1-Distill-Qwen-7B',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

View all →

Name

1 model

Size

Context

Input

DeepSeek-R1-Distill-Qwen-7B:latest

5.4GB · 128K context window · Text · 1 year ago

DeepSeek-R1-Distill-Qwen-7B:latest

5.4GB

128K

Text

Readme

Configured longer sequence length. I recommend running with flash attention and kv-cache quantization if you run out of VRAM.