244 3 months ago

A small and efficient reasoning model, with a hybrid transformer and mamba architecture

tools 3b