149 1 month ago

16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.

tools thinking

1 month ago

b37cd58f6fd3 · 5.2GB

qwen3
·
8.19B
·
Q4_K_M
MIT License Copyright (c) 2023 DeepSeek Permission is hereby granted, free of charge, to any person
{{- $lastUserIndex := -1 }} {{- $hasActiveToolCall := false }} {{- range $index, $_ := .Messages }}
# Devstral - Advanced Coding Assistant System Prompt You are Devstral, an elite coding assistant eng
{ "num_ctx": 16000, "seed": 42, "stop": [ "<|begin▁of▁sentence|>",

Readme

No readme