Japanese instruction-tuned LLM by CyberAgent, distilled from Qwen-72B.

ollama run wao/DeepSeek-R1-Distill-Qwen-32B-Japanese

curl http://localhost:11434/api/chat \
  -d '{
    "model": "wao/DeepSeek-R1-Distill-Qwen-32B-Japanese",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='wao/DeepSeek-R1-Distill-Qwen-32B-Japanese',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'wao/DeepSeek-R1-Distill-Qwen-32B-Japanese',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

9c527286bf42 · 35GB ·

model

archqwen2

parameters32.8B

quantizationQ8_0

35GB

system

You are a friendly assistant.

29B

license

./LICENCE

Readme

DeepSeek-R1-Distill-Qwen-32B-Japanese (GGUF)

🧠 Japanese instruction-tuned LLM by CyberAgent, distilled from Qwen-72B.

🔹 Model Overview

Architecture: Qwen (transformer-based)
Size: 32B parameters (distilled)
Context length: 4096 tokens
Language: Japanese (native), English (partial)
Format: GGUF, quantized (e.g. q8_0)

🔹 Source

https://huggingface.co/cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese

🔹 License

License: MIT
Provider: CyberAgent, Inc.
See full terms: LICENSE

🔹 Tags

japanese, 32b, qwen, cyberagent, ollama, instruction-tuned

🔹 Notes

This model is suitable for Japanese-language instruction tasks, summarization, QA, etc.