rjmalagon/gte-qwen2-1.5b-instruct-embed-f16

rjmalagon/

gte-qwen2-1.5b-instruct-embed-f16

8,831 Downloads Updated 1 year ago

Models

Name

1 model

Size

Context

Input

gte-qwen2-1.5b-instruct-embed-f16:latest

3.6GB · 128K context window · Text · 1 year ago

gte-qwen2-1.5b-instruct-embed-f16:latest

3.6GB

128K

Text

Readme

gte-Qwen2-1.5B-instruct

gte-Qwen2-1.5B-instruct is the latest model in the gte (General Text Embedding) model family. The model is built on Qwen2-1.5B LLM model and use the same training data and strategies as the gte-Qwen2-7B-instruct model.

The model incorporates several key advancements:

Integration of bidirectional attention mechanisms, enriching its contextual understanding.
Instruction tuning, applied solely on the query side for streamlined efficiency
Comprehensive training across a vast, multilingual text corpus spanning diverse domains and scenarios. This training leverages both weakly supervised and supervised data, ensuring the model’s applicability across numerous languages and a wide array of downstream tasks.

Model Information

Model Size: 1.5B
Embedding Dimension: 1536
Max Input Tokens: 32k

Evaluation

MTEB & C-MTEB

You can use the scripts/eval_mteb.py to reproduce the following result of gte-Qwen2-1.5B-instruct on MTEB(English)/C-MTEB(Chinese):

Model Name	MTEB(56)	C-MTEB(35)	MTEB-fr(26)	MTEB-pl(26)
bge-base-en-1.5	64.23	-	-	-
bge-large-en-1.5	63.55	-	-	-
gte-large-en-v1.5	65.39	-	-	-
gte-base-en-v1.5	64.11	-	-	-
mxbai-embed-large-v1	64.68	-	-	-
acge_text_embedding	-	69.07	-	-
stella-mrl-large-zh-v3.5-1792d	-	68.55	-	-
gte-large-zh	-	66.72	-	-
multilingual-e5-base	59.45	56.21	-	-
multilingual-e5-large	61.50	58.81	-	-
e5-mistral-7b-instruct	66.63	60.81	-	-
gte-Qwen1.5-7B-instruct	67.34	69.52	-	-
NV-Embed-v1	69.32	-	-	-
gte-Qwen2-7B-instruct	70.24	72.05	68.25	67.86
gte-Qwen2-1.5B-instruct	67.16	67.65	66.60	64.04

GTE Models

The gte series models have consistently released two types of models: encoder-only models (based on the BERT architecture) and decode-only models (based on the LLM architecture).

Models	Language	Max Sequence Length	Dimension	Model Size (Memory Usage, fp32)
GTE-large-zh	Chinese	512	1024	1.25GB
GTE-base-zh	Chinese	512	512	0.41GB
GTE-small-zh	Chinese	512	512	0.12GB
GTE-large	English	512	1024	1.25GB
GTE-base	English	512	512	0.21GB
GTE-small	English	512	384	0.10GB
GTE-large-en-v1.5	English	8192	1024	1.74GB
GTE-base-en-v1.5	English	8192	768	0.51GB
GTE-Qwen1.5-7B-instruct	Multilingual	32000	4096	26.45GB
GTE-Qwen2-7B-instruct	Multilingual	32000	3584	26.45GB
GTE-Qwen2-1.5B-instruct	Multilingual	32000	1536	6.62GB

Citation

If you find our paper or models helpful, please consider cite:

https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct

## gte-Qwen2-1.5B-instruct

**gte-Qwen2-1.5B-instruct** is the latest model in the gte (General Text Embedding) model family. The model is built on [Qwen2-1.5B](https://huggingface.co/Qwen/Qwen2-1.5B) LLM model and use the same training data and strategies as the [gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct) model.

The model incorporates several key advancements:

- Integration of bidirectional attention mechanisms, enriching its contextual understanding.
- Instruction tuning, applied solely on the query side for streamlined efficiency
- Comprehensive training across a vast, multilingual text corpus spanning diverse domains and scenarios. This training leverages both weakly supervised and supervised data, ensuring the model's applicability across numerous languages and a wide array of downstream tasks.

## Model Information
- Model Size: 1.5B 
- Embedding Dimension: 1536
- Max Input Tokens: 32k

## Evaluation

### MTEB & C-MTEB

You can use the [scripts/eval_mteb.py](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct/blob/main/scripts/eval_mteb.py) to reproduce the following result of **gte-Qwen2-1.5B-instruct** on MTEB(English)/C-MTEB(Chinese):

| Model Name | MTEB(56)  | C-MTEB(35) | MTEB-fr(26) | MTEB-pl(26) | 
|:----:|:---------:|:----------:|:----------:|:----------:|
| [bge-base-en-1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) |   64.23   |     -      |  - | - |
| [bge-large-en-1.5](https://huggingface.co/BAAI/bge-large-en-v1.5) |   63.55   |     -      | - | - |
| [gte-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5) |   65.39   |     -      |  - | - |
| [gte-base-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5) |   64.11   |     -      |  - | - |
| [mxbai-embed-large-v1](https://huggingface.co/mixedbread-ai/mxbai-embed-large-v1) |   64.68   |     -      |  - | - |
| [acge_text_embedding](https://huggingface.co/aspire/acge_text_embedding) |     -     |   69.07    |   - | - |
| [stella-mrl-large-zh-v3.5-1792d](https://huggingface.co/infgrad/stella-mrl-large-zh-v3.5-1792d) |     -     |   68.55    |  - | - |
| [gte-large-zh](https://huggingface.co/thenlper/gte-large-zh) |     -     |   66.72    |  - | - |
| [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) |   59.45   |   56.21    |   - | - |
| [multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) |   61.50   |   58.81    |   - | - |
| [e5-mistral-7b-instruct](https://huggingface.co/intfloat/e5-mistral-7b-instruct) |   66.63   |   60.81    |   - | - |
| [gte-Qwen1.5-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen1.5-7B-instruct) |   67.34   |   69.52    |  - | - |
| [NV-Embed-v1](https://huggingface.co/nvidia/NV-Embed-v1) |   69.32   |     -      |  - | - |
| [**gte-Qwen2-7B-instruct**](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct) | **70.24** | **72.05**  | **68.25**   | **67.86** |
| [**gte-Qwen2-1.5B-instruct**](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct) | **67.16** | **67.65**  | **66.60** | **64.04** |

### GTE Models

The gte series models have consistently released two types of models: encoder-only models (based on the BERT architecture) and decode-only models (based on the LLM architecture).

|                                        Models                                         | Language | Max Sequence Length | Dimension | Model Size (Memory Usage, fp32) |
|:-------------------------------------------------------------------------------------:|:--------:|:-----: |:---------:|:-------------------------------:|
|             [GTE-large-zh](https://huggingface.co/thenlper/gte-large-zh)              | Chinese  | 512 |   1024    |             1.25GB              |
|              [GTE-base-zh](https://huggingface.co/thenlper/gte-base-zh)               | Chinese  | 512 |    512    |             0.41GB              |
|             [GTE-small-zh](https://huggingface.co/thenlper/gte-small-zh)              | Chinese  | 512 |    512    |             0.12GB              |
|                [GTE-large](https://huggingface.co/thenlper/gte-large)                 | English  | 512 |   1024    |             1.25GB              |
|                 [GTE-base](https://huggingface.co/thenlper/gte-base)                  | English  | 512 |    512    |             0.21GB              |
|                [GTE-small](https://huggingface.co/thenlper/gte-small)                 | English  | 512 |    384    |             0.10GB              |
|       [GTE-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5)       | English | 8192 |   1024    |             1.74GB              |
|        [GTE-base-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5)        | English | 8192 |    768    |             0.51GB              |
| [GTE-Qwen1.5-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen1.5-7B-instruct) | Multilingual | 32000 | 4096 | 26.45GB |
|   [GTE-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)   | Multilingual | 32000 | 3584 | 26.45GB |
|   [GTE-Qwen2-1.5B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct)   | Multilingual | 32000 | 1536 | 6.62GB |

## Citation

If you find our paper or models helpful, please consider cite:

[https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)