The current, most capable model that runs on a single GPU. With quantization and tools.
1,683 Pulls 3 Tags Updated 8 months ago