89 Downloads Updated 1 month ago
| Feature | Value |
|---|---|
| vision | true (>=0.11.11) |
| thinking | false |
| tools | false |
| Device | Speed, token/s | Context | VRAM, gb | Versions |
|---|---|---|---|---|
| RTX 3090 24gb | ~30 | 4096 | 17 | IQ4_XS,0.12.2 |
| RTX 3090 24gb | ~30 | 15360 | 19 | IQ4_XS,0.12.2 |
| M1 Max 32gb | ~14 | 4096 | 17 | IQ4_XS,0.12.2 |
| M1 Max 32gb | ~13 | 15360 | 18 | IQ4_XS,0.12.2 |