75 1 week ago

A Deepseek-R1:8b model with Deepseek-R1:1.5b model as drafting model

thinking