18 6 days ago

Fine-tuned from Phi-3-mini-4k-instruct on the Magicoder-OSS-Instruct-75K dataset (2K Python examples). Designed for coding Q&A — explains programming concepts with examples. Runs efficiently on consumer hardware via 4-bit QLoRA.

ollama run hanneselundstrom/phi-coding-instructor

Models

View all →

Readme

phi-coding-instructor

Fine-tuned from Phi-3-mini-4k-instruct on the Magicoder-OSS-Instruct-75K dataset (2K Python examples) using QLoRA.

Fine-tuned on a single T4 GPU via Google Colab. ~30 minutes training time.

Training details

  • Base model: unsloth/Phi-3-mini-4k-instruct
  • Quantization: 4-bit (Q4_K_M)
  • LoRA rank: 16
  • Learning rate: 2e-4
  • Batch size: 8 (2 per device × 4 gradient accumulation)
  • Steps: 500
  • Dataset: 2,000 Python coding instruction/response pairs

Example

What is multi-head attention? → Explains the concept with context about Transformer architecture

Running locally

ollama run hanneselundstrom/phi-coding-instructor