-
stablelm2
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
1.6B 12B54.3K Pulls 84 Tags Updated 2 months ago
-
granite-code
A family of open foundation models by IBM for Code Intelligence
Code 3B 8B52.4K Pulls 138 Tags Updated 6 weeks ago
-
all-minilm
Embedding models on very large sentence level datasets.
Embedding 22M 33M52.2K Pulls 10 Tags Updated 2 months ago
-
phind-codellama
Code generation model based on Code Llama.
Code 34B49.8K Pulls 49 Tags Updated 7 months ago
-
dolphincoder
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
Code 7B47.9K Pulls 35 Tags Updated 3 months ago
-
nous-hermes
General use models based on Llama and Llama 2 from Nous Research.
7B 13B46.1K Pulls 63 Tags Updated 8 months ago
-
sqlcoder
SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks
Code 7B 15B 70B45.4K Pulls 48 Tags Updated 5 months ago
-
llama3-gradient
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
8B 70B43.7K Pulls 35 Tags Updated 2 months ago
-
starling-lm
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
7B43K Pulls 36 Tags Updated 3 months ago
-
yarn-llama2
An extension of Llama 2 that supports a context of up to 128k tokens.
7B 13B42.2K Pulls 67 Tags Updated 8 months ago
-
deepseek-llm
An advanced language model crafted with 2 trillion bilingual tokens.
7B 67B42.2K Pulls 64 Tags Updated 7 months ago
-
xwinlm
Conversational model based on Llama 2 that performs competitively on various benchmarks.
7B 13B42.1K Pulls 80 Tags Updated 8 months ago
-
llama3-chatqa
A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).
8B 70B41.9K Pulls 35 Tags Updated 2 months ago
-
falcon Archive
A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.
7B 40B 180B40.9K Pulls 38 Tags Updated 9 months ago
-
orca2
Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.
7B 13B39.8K Pulls 33 Tags Updated 8 months ago
-
wizardlm
General use model based on Llama 2.
7B 13B 30B38.7K Pulls 73 Tags Updated 3 months ago
-
solar
A compact, yet powerful 10.7B large language model designed for single-turn conversation.
38.1K Pulls 32 Tags Updated 7 months ago
-
samantha-mistral
A companion assistant trained in philosophy, psychology, and personal relationships. Based on Mistral.
7B36.1K Pulls 49 Tags Updated 9 months ago
-
dolphin-phi
2.7B uncensored Dolphin model by Eric Hartford, based on the Phi language model by Microsoft Research.
3B33.3K Pulls 15 Tags Updated 7 months ago
-
stable-beluga
Llama 2 based model fine tuned on an Orca-style dataset. Originally called Free Willy.
7B 13B32.2K Pulls 49 Tags Updated 8 months ago
-
moondream
moondream2 is a small vision language model designed to run efficiently on edge devices.
Vision31.2K Pulls 18 Tags Updated 2 months ago
-
bakllava
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
Vision 7B29.9K Pulls 17 Tags Updated 7 months ago
-
wizardlm-uncensored
Uncensored version of Wizard LM model
13B29.1K Pulls 18 Tags Updated 9 months ago
-
snowflake-arctic-embed
A suite of text embedding models by Snowflake, optimized for performance.
Embedding 22M 33M28.2K Pulls 16 Tags Updated 3 months ago
-
medllama2
Fine-tuned Llama 2 model to answer medical questions based on an open source medical dataset.
7B26.6K Pulls 17 Tags Updated 9 months ago