OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.
2M Pulls 3 Tags Updated 3 weeks ago
A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.
489.1K Pulls 17 Tags Updated 8 months ago
Cohere For AI's language models trained to perform well across 23 different languages.
81.1K Pulls 33 Tags Updated 10 months ago
Codestral is Mistral AI’s first-ever code model designed for code generation tasks.
426.3K Pulls 17 Tags Updated 1 year ago
Merge of the Open Orca OpenChat model and the Garage-bAInd Platypus 2 model. Designed for chat and code generation.
34.1K Pulls 17 Tags Updated 1 year ago
A versatile model for AI software development scenarios, including code completion.
183.7K Pulls 17 Tags Updated 1 year ago
from https://github.com/ClosedCharacter/Peach
723 Pulls 1 Tag Updated 1 year ago
111 billion parameter model optimized for demanding enterprises that require fast, secure, and high-quality AI
53.9K Pulls 5 Tags Updated 5 months ago
An open large reasoning model for real-world solutions by the Alibaba International Digital Commerce Group (AIDC-AI).
50.2K Pulls 5 Tags Updated 9 months ago
Model made ideally for 1-on-1 roleplay, but one that is able to handle scenarios, RPGs and storywriting fine. One of more capable among 8b, uses reliable L3.
13.7K Pulls 1 Tag Updated 12 months ago
2,073 Pulls 1 Tag Updated 7 months ago
157 Pulls 1 Tag Updated 6 months ago
151 Pulls 1 Tag Updated 6 months ago
MiniCPM-V 2.6 is the latest and most capable model in the MiniCPM-V series. It exhibits a significant performance improvement over MiniCPM-Llama3-V 2.5
31.6K Pulls 1 Tag Updated 1 year ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
1,335 Pulls 9 Tags Updated 6 months ago
Ollama version for https://huggingface.co/mradermacher/Qwen2.5-7B-Instruct-abliterated-v2-GGUF
1,119 Pulls 1 Tag Updated 11 months ago
This model is used to translate modern Chinese into Classical Chinese. The dataset used is "The Seventy Biographies"(七十列传) from "Records of the Grand Historian"(史记).
40 Pulls 1 Tag Updated 11 months ago
This is open model for programming.
13 Pulls 1 Tag Updated 3 months ago
Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.
258K Pulls 5 Tags Updated 5 months ago
An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
66K Pulls 7 Tags Updated 12 months ago