35 3 weeks ago

My (Gökdeniz Gülmez) first reasoning model fine-tuned on a custom distill dataset

tools thinking
ollama run goekdenizguelmez/Josie-R1

Applications

Claude Code
Claude Code ollama launch claude --model goekdenizguelmez/Josie-R1
Codex
Codex ollama launch codex --model goekdenizguelmez/Josie-R1
OpenCode
OpenCode ollama launch opencode --model goekdenizguelmez/Josie-R1
OpenClaw
OpenClaw ollama launch openclaw --model goekdenizguelmez/Josie-R1

Models

View all →

Readme

JOSIE-R1-4B

The JOSIE-R1-4B models are reasoning-oriented language models trained on a custom distillation dataset derived from the original Josie-RL-Zero-V1 model. The training process emphasizes instruction fidelity, reasoning consistency, and practical usefulness across a wide range of everyday and technical tasks.

Model Card for Goekdeniz-Guelmez/JOSIE-R1-4B

Model Description

Introducing JOSIE-R1-4B, a new member of the JOSIE family, fine-tuned with a strong focus on openness, instruction alignment, and transparent reasoning behavior. The model is designed to follow user intent closely while retaining the expressive and occasionally unconventional character that defines the Josie lineage.

Benchmarks comming soon!

  • Developed by: Goekdeniz-Guelmez
  • Funded by: Goekdeniz-Guelmez
  • Shared by: Goekdeniz-Guelmez
  • Model type: qwen3
  • Finetuned from model: Qwen/Qwen3-4B