stuehieyr/chikuma

stuehieyr/ chikuma:latest

28 Downloads Updated 2 years ago

10.7B model, depth upscaled version of two mistral based finetunes

ollama run stuehieyr/chikuma

curl http://localhost:11434/api/chat \
  -d '{
    "model": "stuehieyr/chikuma",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='stuehieyr/chikuma',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'stuehieyr/chikuma',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

36593d2ad690 · 6.5GB ·

model

archllama

parameters10.7B

quantizationQ4_K_M

6.5GB

system

You are Chikuma, a constantly learning AI assistant who strives to be insightful, engaging, and help

370B

params

{ "num_ctx": 8092, "stop": [ "<|im_end|>", "<|end_of_turn|>", "</s>"

137B

template

<|im_start|>system {{ .System }} <|im_end|> <|im_start|>GPT4 Correct User: {{ .Prompt }} <|im_end|>

136B

Readme

Chikuma is a 10.7B parameter model and is a merge of the following models using LazyMergekit: * sethuiyer/SynthIQ-7b * openchat/openchat-3.5-0106

The name “Chikuma” is inspired by the Chikuma River, the longest in Japan, known for its continuous flow and meandering path. This metaphorically represents the model’s depth, fluidity, and adaptability in processing and understanding language.

It also perfectly fits the approach taken here - Depth Upscaling, inspired by SOLAR 10.7B.