New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.
tools
124 Pulls Updated 3 months ago
Updated 4 months ago
4 months ago
6896dd6fb8b4 · 40GB
model
archllama
·
parameters70.6B
·
quantizationIQ4_NL
40GB
params
{
"stop": [
"<|start_header_id|>",
"<|end_header_id|>",
"<|eot_id|>"
96B
template
{{- if or .System .Tools }}<|start_header_id|>system<|end_header_id|>
{{- if .System }}
{{ .System
1.5kB
license
Llama 3.3 Acceptable Use Policy
Meta is committed to promoting safe and fair use of its tools and f
5.6kB
license
LLAMA 3.3 COMMUNITY LICENSE AGREEMENT
Llama 3.3 Version Release Date: December 6, 2024
“Agreement
7.6kB
Readme
- Quantization from
fp32
- Using i-matrix calibration dataset
calibration_datav3.txt
- I-Quants models, only if passing tests
iq2_xs, iq2_xxs, iq3_xxs, iq4_nl, iq3_s, iq2_s, iq4_xs, iq3_xs
- Default quantization
iq4_nl
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.