111 1 month ago

A small and efficient reasoning model, with a hybrid transformer and mamba architecture

tools 3b

Models

View all →

Readme

No readme