A 7B math reasoning model from Allen AI, trained with RL-Zero to solve problems step-by-step like a skilled tutor. Supports 65K context for complex multi-step problems - runs on any laptop.

Details

Updated 3 months ago

3 months ago

b99e24b18a54 · 3.5GB ·

model

archolmo2

parameters7.3B

quantization(!unknown_file_type 27!)

3.5GB

template

13B

system

You are a helpful AI assistant specialized in mathematics. Solve problems step by step, showing your

119B

license

# OLMo-3-7B-RLZero-Math A 7B parameter model fine-tuned for mathematical reasoning using the RL-Zero

1.7kB

params

{ "num_ctx": 65536, "stop": [ "<|endoftext|>" ] }

53B

OLMo-3-7B-RLZero-Math: Open-Source Mathematical Reasoning AI

## 🚀 Overview

OLMo-3-7B-RLZero-Math is a 7 billion parameter model fine-tuned specifically for mathematical reasoning using the innovative RL-Zero approach from Allen Institute for AI. It excels at step-by-step problem solving, showing its work clearly like a skilled tutor.

## 🎯 Key Features

65,536 token context - Extended context with YaRN scaling for complex multi-step problems
RL-Zero training - Reinforcement learning from scratch without human demonstrations
Step-by-step reasoning - Trained to show work and explain mathematical logic
Fully open source - Apache 2.0 license, weights and training code available

## 🏷️ Available Versions

| Tag | Size | RAM Required | Description | |—–|——|————–|————-| | iq3_m | 3.2 GB | ~4GB | Smallest, for constrained devices | | iq4_xs | 3.8 GB | ~5GB | Compact 4-bit with good quality | | q4_k_m | 4.2 GB | ~6GB | Recommended - best quality/size ratio | | q5_k_m | 4.9 GB | ~7GB | Higher quality, slightly larger | | q8_0 | 7.2 GB | ~9GB | Near full precision | | f16 | 13.6 GB | ~16GB | Full FP16 precision |

## 💻 Quick Start

  # Recommended version (Q4_K_M)
  ollama run richardyoung/olmo-3-7b-rlzero-math "Solve for x: 3x^2 - 12x + 9 = 0"

  # Smallest version for mobile/edge (IQ3_M)
  ollama run richardyoung/olmo-3-7b-rlzero-math:iq3_m "What is the derivative of sin(x)?"

  # High quality version (Q8_0)
  ollama run richardyoung/olmo-3-7b-rlzero-math:q8_0 "Prove that the sum of the first n odd numbers equals n²"

## 🛠️ Example Use Cases

### Algebra

  ollama run richardyoung/olmo-3-7b-rlzero-math "Solve the system of equations: 2x + 3y = 12, x - y = 1"

### Calculus

  ollama run richardyoung/olmo-3-7b-rlzero-math "Find the derivative of f(x) = x³ · ln(x)"

### Word Problems

  ollama run richardyoung/olmo-3-7b-rlzero-math "A train leaves Station A at 60 mph. Another train leaves Station B
  (300 miles away) at 40 mph heading toward A. When do they meet?"

### Proofs

  ollama run richardyoung/olmo-3-7b-rlzero-math "Prove by induction that 1 + 2 + 3 + ... + n = n(n+1)/2"

## 📋 System Requirements

### Minimum Requirements - RAM: 8GB - CPU: Any modern processor - Storage: 5GB free space

### Recommended Setup - RAM: 16GB - Hardware: Apple Silicon Mac or NVIDIA GPU - Storage: 15GB for multiple quantizations

## 🌟 What Makes This Model Special

RL-Zero Training: Learned mathematical reasoning through reinforcement learning without human demonstrations
Extended Context: 65K tokens allows complex multi-step problems
Clear Explanations: Trained to show work step-by-step
Efficient: Runs on consumer hardware including laptops

## 🤝 Resources & Links

Original Model: allenai/OLMo-3-7B-RLZero-Math
GGUF/MLX Downloads: richardyoung/OLMo-3-7B-RLZero-Math-GGUF

## 🙏 Acknowledgments

Allen Institute for AI (Ai2) for creating this exceptional model
llama.cpp community for quantization tools
Quantization: Richard Young (deepneuro.ai)

## 📝 License

Apache 2.0 - Free for commercial and personal use.

A 7B math reasoning model from Allen AI, trained with RL-Zero to solve problems step-by-step like a skilled tutor. Supports 65K context for complex multi-step problems - runs on any laptop.

Details

Readme

OLMo-3-7B-RLZero-Math: Open-Source Mathematical Reasoning AI