63 Downloads Updated 1 year ago
ollama run ImFineThanks/Integrator-1-R1-ZERO-3B
Updated 1 year ago
1 year ago
b4075145a909 · 2.0GB ·
Integrator-1 is a fine-tuned Large Language Model (LLM) optimized for solving calculus integrals. Built on the Llama-3.2-3B-Instruct base model, it uses reinforcement learning and Low-Rank Adaptation (LoRA) to achieve improved accuracy in integral-solving tasks. The project demonstrates the efficacy of domain-specific fine-tuning for enhancing LLM performance in mathematical applications.
The training dataset consists of 10,000 synthetic integrals, generated via a custom Python script. Key features include:
Format: Integrals and solutions are formatted in LaTeX (e.g., \int_{-8}^{8} -x \, dx).
Solutions: Numerically computed and rounded to integers, constrained to the range [-20, 20].
Difficulty Tiers: Three levels of complexity to ensure progressive learning.
Functions generar_integral() and crear_dataset() handle generation and deduplication. The script is included in the repository for reproducibility.
Integrator-1 was fine-tuned using Unsloth’s framework on Google Colab’s free tier. The process involved:
A testing script (test_model()) evaluated Integrator-1 against the base model on a 50-integral test set. Results:
34%66%94% relative increaseThese metrics highlight the benefit of specialization, though further training could enhance performance.
To use Integrator-1, download the model with the following command ollama run ImFineThanks/Integrator-1-R1-ZERO-3B and to test it, follow the instructions in the repository’s scripts.