State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases. More quantizations.
74 Pulls Updated 12 months ago
Updated 12 months ago
12 months ago
59c096c63c6e · 2.2GB
model
archllama
·
parameters7.24B
·
quantizationIQ2_XS
2.2GB
params
{
"num_ctx": 4096,
"stop": [
"USER:",
"ASSISTANT:"
]
}
47B
template
{{ if .System }}{{ .System }} {{ end }}{{ if .Prompt }}USER: {{ .Prompt }} {{ end }}ASSISTANT: {{ .R
110B
system
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful,
154B
Readme
WizardLM-2 is a next generation state-of-the-art large language model with improved performance on complex chat, multilingual, reasoning and agent use cases. This family includes three cutting-edge models:
wizardlm2:7b
: fastest model, comparable performance with 10x larger open-source models. All quantizations are made with the i-matrix.wizardlm2:8x22b
: the most advanced model, and the best opensource LLM in Microsoft’s internal evaluation on highly complex tasks. Not using the i-matrix for now.
These are additionals quantizations from the official fp16 model: (wizardlm2)[https://ollama.com/library/wizardlm2]