mannix/wizardlm2:7b-iq2

mannix/ wizardlm2:7b-iq2_xxs

108 Downloads Updated 2 years ago

State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases. More quantizations.

ollama run mannix/wizardlm2:7b-iq2_xxs

curl http://localhost:11434/api/chat \
  -d '{
    "model": "mannix/wizardlm2:7b-iq2_xxs",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='mannix/wizardlm2:7b-iq2_xxs',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'mannix/wizardlm2:7b-iq2_xxs',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

371aa0cc1a4d · 2.0GB ·

model

archllama

parameters7.24B

quantizationIQ2_XXS

2.0GB

system

A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful,

154B

params

{ "num_ctx": 4096, "stop": [ "USER:", "ASSISTANT:" ] }

47B

template

{{ if .System }}{{ .System }} {{ end }}{{ if .Prompt }}USER: {{ .Prompt }} {{ end }}ASSISTANT: {{ .R

110B

Readme

WizardLM-2 is a next generation state-of-the-art large language model with improved performance on complex chat, multilingual, reasoning and agent use cases. This family includes three cutting-edge models:

wizardlm2:7b: fastest model, comparable performance with 10x larger open-source models. All quantizations are made with the i-matrix.
wizardlm2:8x22b: the most advanced model, and the best opensource LLM in Microsoft’s internal evaluation on highly complex tasks. Not using the i-matrix for now.

These are additionals quantizations from the official fp16 model: (wizardlm2)[https://ollama.com/library/wizardlm2]