77 Downloads Updated 1 month ago
| Feature | Value |
|---|---|
| vision | true (>=0.11.11) |
| thinking | false |
| tools | false |
| Device | Speed, token/s | Context | VRAM, gb | Versions |
|---|---|---|---|---|
| RTX 3090 24gb | ~34 | 4096 | 20 | UD-Q4_K_XL,0.12.2 |
| RTX 3090 24gb | - | 15360 | “cudaMalloc failed” | UD-Q4_K_XL,0.12.2 |
| M1 Max 32gb | ~13 | 4096 | 19 | UD-Q4_K_XL,0.12.2 |