6 2 days ago

A 7B math reasoning model from Allen AI, trained with RL-Zero to solve problems step-by-step like a skilled tutor. Supports 65K context for complex multi-step problems - runs on any laptop.

2 days ago

b99e24b18a54 ยท 3.5GB ยท

olmo2
ยท
7.3B
ยท
(!unknown_file_type 27!)
{{ .Prompt }}
You are a helpful AI assistant specialized in mathematics. Solve problems step by step, showing your
# OLMo-3-7B-RLZero-Math A 7B parameter model fine-tuned for mathematical reasoning using the RL-Zero
{ "num_ctx": 65536, "stop": [ "<|endoftext|>" ] }

Readme

OLMo-3-7B-RLZero-Math: Open-Source Mathematical Reasoning AI

## ๐Ÿš€ Overview

OLMo-3-7B-RLZero-Math is a 7 billion parameter model fine-tuned specifically for mathematical reasoning using the innovative RL-Zero approach from Allen Institute for AI. It excels at step-by-step problem solving, showing its work clearly like a skilled tutor.

## ๐ŸŽฏ Key Features

  • 65,536 token context - Extended context with YaRN scaling for complex multi-step problems
  • RL-Zero training - Reinforcement learning from scratch without human demonstrations
  • Step-by-step reasoning - Trained to show work and explain mathematical logic
  • Fully open source - Apache 2.0 license, weights and training code available

## ๐Ÿท๏ธ Available Versions

| Tag | Size | RAM Required | Description | |โ€”โ€“|โ€”โ€”|โ€”โ€”โ€”โ€”โ€“|โ€”โ€”โ€”โ€”-| | iq3_m | 3.2 GB | ~4GB | Smallest, for constrained devices | | iq4_xs | 3.8 GB | ~5GB | Compact 4-bit with good quality | | q4_k_m | 4.2 GB | ~6GB | Recommended - best quality/size ratio | | q5_k_m | 4.9 GB | ~7GB | Higher quality, slightly larger | | q8_0 | 7.2 GB | ~9GB | Near full precision | | f16 | 13.6 GB | ~16GB | Full FP16 precision |

## ๐Ÿ’ป Quick Start

  # Recommended version (Q4_K_M)
  ollama run richardyoung/olmo-3-7b-rlzero-math "Solve for x: 3x^2 - 12x + 9 = 0"

  # Smallest version for mobile/edge (IQ3_M)
  ollama run richardyoung/olmo-3-7b-rlzero-math:iq3_m "What is the derivative of sin(x)?"

  # High quality version (Q8_0)
  ollama run richardyoung/olmo-3-7b-rlzero-math:q8_0 "Prove that the sum of the first n odd numbers equals nยฒ"

## ๐Ÿ› ๏ธ Example Use Cases

### Algebra

  ollama run richardyoung/olmo-3-7b-rlzero-math "Solve the system of equations: 2x + 3y = 12, x - y = 1"

### Calculus

  ollama run richardyoung/olmo-3-7b-rlzero-math "Find the derivative of f(x) = xยณ ยท ln(x)"

### Word Problems

  ollama run richardyoung/olmo-3-7b-rlzero-math "A train leaves Station A at 60 mph. Another train leaves Station B
  (300 miles away) at 40 mph heading toward A. When do they meet?"

### Proofs

  ollama run richardyoung/olmo-3-7b-rlzero-math "Prove by induction that 1 + 2 + 3 + ... + n = n(n+1)/2"

## ๐Ÿ“‹ System Requirements

### Minimum Requirements - RAM: 8GB - CPU: Any modern processor - Storage: 5GB free space

### Recommended Setup - RAM: 16GB - Hardware: Apple Silicon Mac or NVIDIA GPU - Storage: 15GB for multiple quantizations

## ๐ŸŒŸ What Makes This Model Special

  1. RL-Zero Training: Learned mathematical reasoning through reinforcement learning without human demonstrations
  2. Extended Context: 65K tokens allows complex multi-step problems
  3. Clear Explanations: Trained to show work step-by-step
  4. Efficient: Runs on consumer hardware including laptops

## ๐Ÿค Resources & Links

## ๐Ÿ™ Acknowledgments

  • Allen Institute for AI (Ai2) for creating this exceptional model
  • llama.cpp community for quantization tools
  • Quantization: Richard Young (deepneuro.ai)

## ๐Ÿ“ License

Apache 2.0 - Free for commercial and personal use.