The current, most capable model that runs on a single GPU. With quantization and tools.
1,801 Pulls 3 Tags Updated 11 months ago