wangrongsheng

taiwanllm-7b-v2.1-chat

🦙 Taiwan LLM is an advanced language model tailored for Traditional Chinese, focusing on the linguistic and cultural contexts of Taiwan.

651 Pulls 1 Tag Updated 1 year ago

mistral-7b-v0.3-chinese

Mistral-7B-v0.3-Chinese is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the Mistral-7B-Instruct-v0.3.

614 Pulls 1 Tag Updated 1 year ago

llama3-8b-chinese-chat

🦙🦙🦙 Llama3-8B-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the Meta-Llama-3-8B-Instruct model.

427 Pulls 1 Tag Updated 1 year ago

taiwanllm-13b-v2.0-chat

🦙 Taiwan LLM is an advanced language model tailored for Traditional Chinese, focusing on the linguistic and cultural contexts of Taiwan.

329 Pulls 1 Tag Updated 1 year ago

sfr-iterative-dpo-llama-3-8b-r

SFR-Iterative-DPO-LLaMA-3-8B-R is a further (SFT and RLHF) fine-tuned model on LLaMA-3-8B, which provides good performance. The model is from Salesforce team.

208 Pulls 1 Tag Updated 1 year ago

llama3-70b-chinese-chat

🦙🦙🦙 Llama3-70B-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the Meta-Llama-3-70B-Instruct model.

199 Pulls 1 Tag Updated 1 year ago

openchat

Llama-3 based version OpenChat 3.6 20240522, outperforming official Llama 3 8B Instruct and open-source finetunes/merges.

188 Pulls 1 Tag Updated 1 year ago

faro-yi-9b-dpo

The Faro chat model focuses on practicality and long-context modeling. It handles various downstream tasks with higher quality, delivering stable and reliable results even when inputs contain lengthy documents or complex instructions.

158 Pulls 1 Tag Updated 1 year ago

mistral-7b-v0.3-chinese-chat

🥐 Mistral-7B-v0.3-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the mistralai/Mistral-7B-Instruct-v0.3.

141 Pulls 1 Tag Updated 1 year ago

aurora

🐳 Aurora represents the Chinese version of the MoE model, refined from the Mixtral-8x7B architecture. It adeptly unlocks the model’s potential for bilingual dialogue in both Chinese and English across a wide range of open-domain topics.

88 Pulls 1 Tag Updated 1 year ago