Yet Another Yi-9B Model (User Converted)

111 Pulls Updated 3 days ago

Readme

Yet Another Yi-9B Model (User Converted)

Introduction

Because Yi official hasn’t publish the Yi-9B model on Ollama library yet. So I followed the Ollama doc and converted one.

Feel free to try it out and send me your feedbacks.

Please also check out my blog post about the building process.

Note: Yi-9B is a base model, I didn’t apply any fine tune but I’d like to. Please send your suggestions as well.

Note 2: Yi-1.5-9B-Chat is now available.

Changelog

About Yi-9B

🎯 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.

🎯 2024-03-06: The Yi-9B is open-sourced and available to the public.

Yi-9B stands out as the top performer among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

ref: https://huggingface.co/01-ai/Yi-9B#yi-9b

Yi-9B is almost the best among a range of similar-sized open-source models (including Mistral-7B, SOLAR-10.7B, Gemma-7B, DeepSeek-Coder-7B-Base-v1.5 and more), particularly excelling in code, math, common-sense reasoning, and reading comprehension.

In terms of overall ability (Mean-All), Yi-9B performs the best among similarly sized open-source models, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B, and Gemma-7B.

In terms of coding ability (Mean-Code), Yi-9B’s performance is second only to DeepSeek-Coder-7B, surpassing Yi-34B, SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of math ability (Mean-Math), Yi-9B’s performance is second only to DeepSeek-Math-7B, surpassing SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of common sense and reasoning ability (Mean-Text), Yi-9B’s performance is on par with Mistral-7B, SOLAR-10.7B, and Gemma-7B.

Feedback

Please feel free to follow me on X and discuss about this model.