216 Downloads Updated 5 months ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.
tools
Models
View all →Readme
- Quantization from
fp32
- Using i-matrix calibration dataset
calibration_datav3.txt
- I-Quants models, only if passing tests
iq2_xs, iq2_xxs, iq3_xxs, iq4_nl, iq3_s, iq2_s, iq4_xs, iq3_xs
- Default quantization
iq4_nl
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.