-
gemma-2-2b-jpn-it
Gemma-2-JPN is a Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language with the same level of performance of English only queries on Gemma 2.
660 Pulls 14 Tags Updated 2 months ago
-
mistral-nemo-minitron-8b-instruct
Mistral-NeMo-Minitron-8B-Instruct is a model for generating responses for various text-generation tasks including roleplaying, retrieval augmented generation, and function calling.
456 Pulls 14 Tags Updated 2 months ago
-
llama-3.1-swallow-8b-instruct-v0.1
Llama 3.1 Swallow is a series of large language models (8B, 70B) that were built by continual pre-training on the Meta Llama 3.1 models.
421 Pulls 13 Tags Updated 2 months ago
-
calm3-22b-chat
CyberAgentLM3 is a decoder-only language model pre-trained on 2.0 trillion tokens from scratch. CyberAgentLM3-Chat is a fine-tuned model specialized for dialogue use cases.
66 Pulls 14 Tags Updated 2 months ago
-
gemma-2-baku-2b-it
The model is an instruction-tuned variant of rinna/gemma-2-baku-2b, utilizing Chat Vector and Odds Ratio Preference Optimization (ORPO) for fine-tuning. It adheres to the gemma-2 chat format.
50 Pulls 14 Tags Updated 2 months ago