schroneko

schroneko

ぬこぬこ

https://note.com/schroneko/

gemma-2-2b-jpn-it

Gemma-2-JPN is a Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language with the same level of performance of English only queries on Gemma 2.

7,063 Pulls 14 Tags Updated 1 year ago
llama-3.1-swallow-8b-instruct-v0.1

Llama 3.1 Swallow is a series of large language models (8B, 70B) that were built by continual pre-training on the Meta Llama 3.1 models.

2,129 Pulls 13 Tags Updated 1 year ago
mistral-nemo-minitron-8b-instruct

Mistral-NeMo-Minitron-8B-Instruct is a model for generating responses for various text-generation tasks including roleplaying, retrieval augmented generation, and function calling.

1,118 Pulls 14 Tags Updated 1 year ago
calm3-22b-chat

CyberAgentLM3 is a decoder-only language model pre-trained on 2.0 trillion tokens from scratch. CyberAgentLM3-Chat is a fine-tuned model specialized for dialogue use cases.

799 Pulls 14 Tags Updated 1 year ago
gemma-2-baku-2b-it

The model is an instruction-tuned variant of rinna/gemma-2-baku-2b, utilizing Chat Vector and Odds Ratio Preference Optimization (ORPO) for fine-tuning. It adheres to the gemma-2 chat format.

279 Pulls 14 Tags Updated 1 year ago
smollm-135m

SmolLM-135M-GGUF quantized to Q4_0 GGUF for efficient inference.

60 Pulls 1 Tag Updated 4 months ago