368 Downloads Updated 2 months ago
Based on https://huggingface.co/lmstudio-community/DeepSeek-Coder-V2-Lite-Instruct-GGUF
| Feature | Value |
|---|---|
| vision | false |
| thinking | false |
| tools | true |
| Device | Speed, token/s | Context | VRAM, gb | Versions |
|---|---|---|---|---|
| RTX 3090 24gb | ~139 | 4096 | 10 | IQ4_XS,0.12.2 |
| RTX 3090 24gb | ~139 | 15360 | 13 | IQ4_XS,0.12.2 |
| RTX 2080ti 11gb | ~114 | 4096 | 10 | IQ4_XS,0.12.2 |
| RTX 2080ti 11gb | ~37 | 15360 | 14 (23%/77% CPU/GPU) | IQ4_XS, 0.12.2 |
| M1 Max 32gb | ~80 | 4096 | 10 | IQ4_XS,0.12.2 |
| M1 Max 32gb | ~81 | 15360 | 14 | IQ4_XS,0.12.2 |
| RTX 3070ti Mobile 8gb | ~31 | 4096 | 10 (26%/74% CPU/GPU) | IQ4_XS, 0.12.3 |
| RTX 3070ti Mobile 8gb | ~30 | 15360 | 14 (46%/54% CPU/GPU) | IQ4_XS, 0.12.3 |