New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.

tools

124 3 months ago

Readme

  • Quantization from fp32
  • Using i-matrix calibration dataset calibration_datav3.txt
  • I-Quants models, only if passing tests iq2_xs, iq2_xxs, iq3_xxs, iq4_nl, iq3_s, iq2_s, iq4_xs, iq3_xs
  • Default quantization iq4_nl

image.png

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.

Model reference