State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases. More quantizations.
52 Pulls Updated 7 months ago
Updated 7 months ago
7 months ago
371aa0cc1a4d · 2.0GB
model
archllama
·
parameters7.24B
·
quantizationIQ2_XXS
2.0GB
params
{
"num_ctx": 4096,
"stop": [
"USER:",
"ASSISTANT:"
]
}
47B
template
{{ if .System }}{{ .System }} {{ end }}{{ if .Prompt }}USER: {{ .Prompt }} {{ end }}ASSISTANT: {{ .R
110B
system
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful,
154B
Readme
WizardLM-2 is a next generation state-of-the-art large language model with improved performance on complex chat, multilingual, reasoning and agent use cases. This family includes three cutting-edge models:
wizardlm2:7b
: fastest model, comparable performance with 10x larger open-source models. All quantizations are made with the i-matrix.wizardlm2:8x22b
: the most advanced model, and the best opensource LLM in Microsoft’s internal evaluation on highly complex tasks. Not using the i-matrix for now.
These are additionals quantizations from the official fp16 model: (wizardlm2)[https://ollama.com/library/wizardlm2]