tukia/llama-4-Scout-17b-16e-Instruct-q4_K

tukia/ llama-4-Scout-17b-16e-Instruct-q4_K_M:latest

757 Downloads Updated 10 months ago

long context 10 million token context window size llama 4 scout model

vision tools

ollama run tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M

curl http://localhost:11434/api/chat \
  -d '{
    "model": "tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'tukia/llama-4-Scout-17b-16e-Instruct-q4_K_M',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 10 months ago

10 months ago

f5e81ab317cf · 67GB ·

model

archllama4

parameters109B

quantizationQ4_K_M

67GB

params

{ "num_ctx": 4096, "temperature": 0.1, "top_p": 0.9 }

47B

template

{{- if .System }}<|header_start|>system<|header_end|> {{- with .Tools }}Environment: ipython You hav

1.1kB

system

You are a helpful and intelligent AI assistant. For every problem you solve, always explain your rea

576B

Readme

downloaded from https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct converted and quantized using v0.6.7-rc1

Updated system prompt to use chain of thought
1.8TB of memory is required for 10M size. You could swap to disk …..