The current, most capable model that runs on a single GPU. With quantization and tools.
1,834 Pulls 3 Tags Updated 1 year ago