shinyzhu/ yayi

241 Downloads Updated 2 years ago

Yet Another Yi-9B Model (User Converted)

9b

ollama run shinyzhu/yayi

curl http://localhost:11434/api/chat \
  -d '{
    "model": "shinyzhu/yayi",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='shinyzhu/yayi',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'shinyzhu/yayi',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

4 models

Size / Usage

Context

Input

yayi:latest

5.0GB · 4K context window · Text · 2 years ago

yayi:latest

5.0GB

4K

Text

yayi:9b

5.0GB · 4K context window · Text · 2 years ago

yayi:9b

5.0GB

4K

Text

Readme

Yet Another Yi-9B Model (User Converted)

Introduction

Because Yi official hasn’t publish the Yi-9B model on Ollama library yet. So I followed the Ollama doc and converted one.

Feel free to try it out and send me your feedbacks.

Please also check out my blog post about the building process.

Note: Yi-9B is a base model, I didn’t apply any fine tune but I’d like to. Please send your suggestions as well.

Note 2: Yi-1.5-9B-Chat is now available.

Changelog

2024-05-13: latest is updated to Yi-1.5-9B-Chat. You can also check out 1.5-9b-chat.
2024-04-18: Yi-9B is updated to the latest commit. And Yi-9B-200K is ready for download.
2024-04-07: Created. See this blog post.

About Yi-9B

🎯 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.

🎯 2024-03-06: The Yi-9B is open-sourced and available to the public.

Yi-9B stands out as the top performer among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

ref: https://huggingface.co/01-ai/Yi-9B#yi-9b

Yi-9B is almost the best among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

In terms of overall ability (Mean-All), Yi-9B performs the best among similarly sized open-source models, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B, and Gemma-7B.

In terms of coding ability (Mean-Code), Yi-9B’s performance is second only to DeepSeek-Coder-7B, surpassing Yi-34B, SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of math ability (Mean-Math), Yi-9B’s performance is second only to DeepSeek-Math-7B, surpassing SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of common sense and reasoning ability (Mean-Text), Yi-9B’s performance is on par with Mistral-7B, SOLAR-10.7B, and Gemma-7B.

Feedback

Please feel free to follow me on X and discuss about this model.

# Yet Another Yi-9B Model (User Converted)

## Introduction

Because [Yi official](https://ollama.com/library/yi) hasn't publish the Yi-9B model on Ollama library yet. So I followed the [Ollama doc](https://github.com/ollama/ollama/blob/main/docs/import.md) and converted one.

Feel free to try it out and send me your feedbacks.

Please also [check out my blog post about the building process](https://shinyzhu.com/posts/2024/importing-yi9b-to-ollama/).

Note: **Yi-9B is a base model**, I didn't apply any fine tune but I'd like to. Please send your suggestions as well.

Note 2: Yi-1.5-9B-Chat is now available.

## Changelog

- **2024-05-13**: `latest` is updated to [Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat). You can also check out `1.5-9b-chat`.
- **2024-04-18**: Yi-9B is updated to [the latest commit](https://github.com/01-ai/Yi/commit/731b2af8583cba38d6544ebf909d7c85545f75a8). And Yi-9B-200K is ready for download.
- **2024-04-07**: Created. See [this blog post](https://shinyzhu.com/posts/2024/importing-yi9b-to-ollama/).

## About Yi-9B

> 🎯 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.
> 
> 🎯 2024-03-06: The Yi-9B is open-sourced and available to the public.
> 
> Yi-9B stands out as the top performer among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

ref: <https://huggingface.co/01-ai/Yi-9B#yi-9b>

Yi-9B is almost the best among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

![](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_details.png?raw=true)

In terms of overall ability (Mean-All), Yi-9B performs the best among similarly sized open-source models, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B, and Gemma-7B.

![](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_overall.png?raw=true)

In terms of coding ability (Mean-Code), Yi-9B's performance is second only to DeepSeek-Coder-7B, surpassing Yi-34B, SOLAR-10.7B, Mistral-7B, and Gemma-7B.

![](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_code.png?raw=true)

In terms of math ability (Mean-Math), Yi-9B's performance is second only to DeepSeek-Math-7B, surpassing SOLAR-10.7B, Mistral-7B, and Gemma-7B.

![](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_math.png?raw=true)

In terms of common sense and reasoning ability (Mean-Text), Yi-9B's performance is on par with Mistral-7B, SOLAR-10.7B, and Gemma-7B.

![](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_text.png?raw=true)

## Feedback

Please feel free to [follow me on X](https://twitter.com/shinyzhu) and discuss about this model.

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)