112 1 month ago

A small and efficient reasoning model, with a hybrid transformer and mamba architecture

tools 3b