826 2 months ago

Llama3.2 1b trained on data distilled from gpt4o, claude3.5 and claude opus

tools

Models

View all →

Readme

Llama 3.2 4o Claude

The catsarethebest/llama3.2-4oClaude:latest model was trained on a mix of datasets distilled from GPT-4o, Claude 3.5, and Claude 3.5 Opus.

The catsarethebest/llama3.2-4oClaude:o4-mini version is a fine-tuned variant of :latest, trained further on a smaller, fully synthetic dataset distilled from o4-mini.