574 8 months ago

A small and efficient reasoning model, with a hybrid transformer and mamba architecture

tools 3b