A high-performing open embedding model with a large token context window.
20.7M Pulls 3 Tags Updated 13 months ago
State-of-the-art large embedding model from mixedbread.ai
2M Pulls 4 Tags Updated 10 months ago
A suite of text embedding models by Snowflake, optimized for performance.
702.9K Pulls 16 Tags Updated 11 months ago
Embedding models on very large sentence level datasets.
351K Pulls 10 Tags Updated 10 months ago
Embedding model from BAAI mapping texts to vectors.
94.3K Pulls 3 Tags Updated 7 months ago
Snowflake's frontier embedding model. Arctic Embed 2.0 adds multilingual support without sacrificing English performance or scalability.
53.2K Pulls 3 Tags Updated 3 months ago
The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.
27.1K Pulls 6 Tags Updated 3 months ago
EXAONE 3.5 is a collection of instruction-tuned bilingual (English and Korean) generative models ranging from 2.4B to 32B parameters, developed and released by LG AI Research.
33.4K Pulls 13 Tags Updated 3 months ago
multilingual e5 large instruct embedding model from intfloat
368.4K Pulls 3 Tags Updated 7 weeks ago
https://huggingface.co/DMetaSoul/Dmeta-embedding-zh
151.5K Pulls 1 Tag Updated 13 days ago
(Updated: 07/1/2024) https://huggingface.co/jinaai/jina-embeddings-v2-base-code
73.4K Pulls 15 Tags Updated 9 months ago
Text embedding model (base) for English and German input of size up to 8192 tokens
73K Pulls 1 Tag Updated 10 months ago
64.3K Pulls 1 Tag Updated 12 months ago
An embedding model created by Salesforce Research that you can use for semantic search. Currently the best open source embedding model on MTEB.
25.7K Pulls 4 Tags Updated 11 months ago
BAAI General Embedding
25.4K Pulls 16 Tags Updated 13 months ago
Text embedding model (base) for input of size up to 8192 tokens
8,027 Pulls 1 Tag Updated 10 months ago
6,421 Pulls 1 Tag Updated 9 months ago
Moka-AI Massive Mixed Embedding
5,577 Pulls 7 Tags Updated 12 months ago
A cross-domain, cross-task, out-of-the-box Chinese embedding model.
4,460 Pulls 1 Tag Updated 11 months ago
Text embedding model (base) for English and Spanish input of size up to 8192 tokens
4,036 Pulls 1 Tag Updated 10 months ago