Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.
80.6K Pulls 33 Tags Updated 1 year ago
The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.
3M Pulls 36 Tags Updated 1 year ago
A very small test: gemma3:1b fine tuned with a dataset obsessed with spiders. As a result, this model puts spiders in all its answers. This is useless: just a pet project to learn to generate a dataset and fine tune a small model.
39 Pulls 1 Tag Updated 5 months ago
Ollama models of DeepSeek Janus Pro 7B
5,306 Pulls 11 Tags Updated 8 months ago
*Yip! Yip-yip!* https://github.com/mcandre/ollama-models
1 Tag Updated 6 months ago
LLaMA 3.1 8B Instruct model fine-tuned for advanced Wazuh security log analysis with instruction-following capabilities.
8 Pulls 1 Tag Updated 1 week ago
LLaMA 3.1 8B Instruct model fine-tuned for advanced Wazuh security log analysis with instruction-following capabilities
6 Pulls 1 Tag Updated 1 week ago
MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.
44.7K Pulls 8 Tags Updated 1 year ago
Model made ideally for 1-on-1 roleplay, but one that is able to handle scenarios, RPGs and storywriting fine. One of more capable among 8b, uses reliable L3.
14K Pulls 1 Tag Updated 1 year ago
The official ollama model for Gemma-2-9B-Chinese-Chat (https://huggingface.co/shenzhi-wang/Gemma-2-9B-Chinese-Chat).
4,606 Pulls 1 Tag Updated 1 year ago
The ollama model for the f16 GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-f16).
4,315 Pulls 2 Tags Updated 1 year ago
The official ollama model for Gemma-2-27B-Chinese-Chat (https://huggingface.co/shenzhi-wang/Gemma-2-27B-Chinese-Chat).
1,578 Pulls 1 Tag Updated 1 year ago
Stheno-v3.2-Zeta I have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most.
567 Pulls 1 Tag Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.
284 Pulls 9 Tags Updated 9 months ago
LLaMA 3.1 8B Instruct model fine-tuned for AWS cloud security event analysis.
4 Pulls 1 Tag Updated 1 week ago
The ollama model for the 8bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit).
15.6K Pulls 2 Tags Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).
7,466 Pulls 2 Tags Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).
2,752 Pulls 1 Tag Updated 1 year ago
The ollama model for the 8bit-quantized GGUF version of llama3-70b-chinese-chat.
1,959 Pulls 1 Tag Updated 1 year ago
A quick and dirty ollama model made from https://huggingface.co/lmstudio-community/Phi-3.5-mini-instruct-GGUF
244 Pulls 1 Tag Updated 1 year ago