50 1 year ago

This is a distill model that trained from the dataset of TieBa latest. Used about 8k data and think chain from DeepSeek-V3.

ollama run Hanversion/MAGA-T1-Tieba-1.5B-Distill

Models

View all →

Readme

No readme