The current, most capable model that runs on a single GPU. With quantization and tools.
1,618 Pulls 3 Tags Updated 6 months ago