A suite of text embedding models by Snowflake, optimized for performance.

embedding 22m 33m 110m 137m 335m

212.4K 7 months ago

Readme

snowflake-arctic-embed is a suite of text embedding models that focuses on creating high-quality retrieval models optimized for performance.

The models are trained by leveraging existing open-source text representation models, such as bert-base-uncased, and are trained in a multi-stage pipeline to optimize their retrieval performance.

This model is available in 5 parameter sizes:

  • snowflake-arctic-embed:335m (default)
  • snowflake-arctic-embed:137m
  • snowflake-arctic-embed:110m
  • snowflake-arctic-embed:33m
  • snowflake-arctic-embed:22m

Reference

Blog Post

HuggingFace