146 1 month ago

16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.

tools thinking

Models

View all →

Readme

No readme