 
            
          - 
      
        gemma-2-2b-jpn-itGemma-2-JPN is a Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language with the same level of performance of English only queries on Gemma 2. 5,601 Pulls 14 Tags Updated 1 year ago 
- 
      
        llama-3.1-swallow-8b-instruct-v0.1Llama 3.1 Swallow is a series of large language models (8B, 70B) that were built by continual pre-training on the Meta Llama 3.1 models. 1,172 Pulls 13 Tags Updated 1 year ago 
- 
      
        mistral-nemo-minitron-8b-instructMistral-NeMo-Minitron-8B-Instruct is a model for generating responses for various text-generation tasks including roleplaying, retrieval augmented generation, and function calling. 772 Pulls 14 Tags Updated 1 year ago 
- 
      
        calm3-22b-chatCyberAgentLM3 is a decoder-only language model pre-trained on 2.0 trillion tokens from scratch. CyberAgentLM3-Chat is a fine-tuned model specialized for dialogue use cases. 333 Pulls 14 Tags Updated 1 year ago 
- 
      
        gemma-2-baku-2b-itThe model is an instruction-tuned variant of rinna/gemma-2-baku-2b, utilizing Chat Vector and Odds Ratio Preference Optimization (ORPO) for fine-tuning. It adheres to the gemma-2 chat format. 236 Pulls 14 Tags Updated 1 year ago