Details

Updated 7 hours ago

7 hours ago

627bcac0f0de · 5.6GB ·

model

archqwen35

parameters8.95B

quantizationQ4_K_M

5.6GB

template

13B

system

/nothink You are a coding agent running inside Open Code. Be concise. Avoid loops. Use tools only wh

250B

license

Credits to Jackrong. Original model: Qwen3.5 9B DeepSeek V4 Flash

67B

params

{ "num_ctx": 64000, "repeat_last_n": 2048, "repeat_penalty": 1.12, "seed": 79, "

112B

OCode79 - 9B param, Q4, 64K ctx, Local/Offline, 16GB (or 2x 8GB) GPU

Custom Ollama model, fine-tuned from Qwen3.5-9B DeepSeek V4 Flash from Jackrong, configured for local coding-agent workflows, especially with Open Code.

This model is based on a 9B parameter LLM, quantized in Q4, and configured with a large context window for software development tasks. It is intended to provide a practical balance between performance, memory usage, and code-assistance quality on local hardware.

Model details

Type: Text/image model
Size: 9B parameters
Quantization: Q4
Context target: 64K
Real GPU memory usage: 9,6 GB VRAM
Recommended GPU memory: 16 GB VRAM
Main focus: Coding and agentic development workflows
Tool use: Supported, depending on the client/application
Thinking/reasoning mode: Supported, depending on the client/application

Intended use

This model is designed for:

Open Code workflows
Local coding assistants
Code analysis
Debugging support
Refactoring suggestions
Project exploration
Terminal-based programming tasks
Educational demonstrations of AI coding agents

Custom model for Open Code to use locally with 16gb or 2x8gb GPUs (working fine?...)

Details

Readme

OCode79 - 9B param, Q4, 64K ctx, Local/Offline, 16GB (or 2x 8GB) GPU

Model details

Intended use