bge-m3:567m-fp16

3.2M Downloads Updated 1 year ago

BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.

embedding 567m

ollama pull bge-m3:567m-fp16

curl http://localhost:11434/api/embed \
  -d '{
    "model": "bge-m3:567m-fp16",
    "input": "Why is the sky blue?"
  }'

import ollama

response = ollama.embed(
    model='bge-m3:567m-fp16',
    input='The sky is blue because of Rayleigh scattering',
)
print(response.embeddings)

import ollama from 'ollama'

const response = await ollama.embed({
  model: 'bge-m3:567m-fp16',
  input: 'The sky is blue because of Rayleigh scattering',
})
console.log(response.embeddings)

Details

Updated 1 year ago

1 year ago

790764642607 · 1.2GB ·

model

archbert

parameters567M

quantizationF16

1.2GB

license

MIT License Copyright (c) [year] [fullname] Permission is hereby granted, free of charge, to any per

1.1kB

Readme

BGE-M3 is based on the XLM-RoBERTa architecture and is distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity:

Multi-Functionality: It can simultaneously perform the three common retrieval functionalities of embedding model: dense retrieval, multi-vector retrieval, and sparse retrieval.
Multi-Linguality: It can support more than 100 working languages.
Multi-Granularity: It is able to process inputs of different granularities, spanning from short sentences to long documents of up to 8192 tokens.

Benchmarks from the open-source community

@misc{bge-m3,
      title={BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation}, 
      author={Jianlv Chen and Shitao Xiao and Peitian Zhang and Kun Luo and Defu Lian and Zheng Liu},
      year={2024},
      eprint={2402.03216},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}