757 10 months ago

long context 10 million token context window size llama 4 scout model

vision tools
ollama run tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M

Applications

Claude Code
Claude Code ollama launch claude --model tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M
Codex
Codex ollama launch codex --model tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M
OpenCode
OpenCode ollama launch opencode --model tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M
OpenClaw
OpenClaw ollama launch openclaw --model tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M

Models

View all →

Readme

downloaded from https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct converted and quantized using v0.6.7-rc1

  • Updated system prompt to use chain of thought
  • 1.8TB of memory is required for 10M size. You could swap to disk …..