18 6 days ago

Fine-tuned from Phi-3-mini-4k-instruct on the Magicoder-OSS-Instruct-75K dataset (2K Python examples). Designed for coding Q&A — explains programming concepts with examples. Runs efficiently on consumer hardware via 4-bit QLoRA.

ollama run hanneselundstrom/phi-coding-instructor

Details

6 days ago

6aaceff6feb0 · 2.3GB ·

llama
·
3.82B
·
Q4_K_M
{{ if .System }}<|system|> {{ .System }}<|end|> {{ end }}{{ if .Prompt }}<|user|> {{ .Prompt }}<|end
{ "min_p": 0.1, "stop": [ "<|end|>", "<|user|>", "<|assistant|>"

Readme

phi-coding-instructor

Fine-tuned from Phi-3-mini-4k-instruct on the Magicoder-OSS-Instruct-75K dataset (2K Python examples) using QLoRA.

Fine-tuned on a single T4 GPU via Google Colab. ~30 minutes training time.

Training details

  • Base model: unsloth/Phi-3-mini-4k-instruct
  • Quantization: 4-bit (Q4_K_M)
  • LoRA rank: 16
  • Learning rate: 2e-4
  • Batch size: 8 (2 per device × 4 gradient accumulation)
  • Steps: 500
  • Dataset: 2,000 Python coding instruction/response pairs

Example

What is multi-head attention? → Explains the concept with context about Transformer architecture

Running locally

ollama run hanneselundstrom/phi-coding-instructor