rjmalagon/gte-qwen2-7b-instruct

rjmalagon/ gte-qwen2-7b-instruct

3,174 Downloads Updated 1 year ago

ollama run rjmalagon/gte-qwen2-7b-instruct:f16

curl http://localhost:11434/api/chat \
  -d '{
    "model": "rjmalagon/gte-qwen2-7b-instruct:f16",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='rjmalagon/gte-qwen2-7b-instruct:f16',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'rjmalagon/gte-qwen2-7b-instruct:f16',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

2 models

Size / Usage

Context

Input

gte-qwen2-7b-instruct:f16

15GB · 128K context window · Text · 1 year ago

gte-qwen2-7b-instruct:f16

15GB

128K

Text

gte-qwen2-7b-instruct:bf16

15GB · 128K context window · Text · 1 year ago

gte-qwen2-7b-instruct:bf16

15GB

128K

Text

Readme

gte-Qwen2-7B-instruct is the latest model in the gte (General Text Embedding) model family that ranks No.1 in both English and Chinese evaluations on the Massive Text Embedding Benchmark MTEB benchmark (as of June 16, 2024).

Recently, the Qwen team released the Qwen2 series models, and we have trained the gte-Qwen2-7B-instruct model based on the Qwen2-7B LLM model. Compared to the gte-Qwen1.5-7B-instruct model, the gte-Qwen2-7B-instruct model uses the same training data and training strategies during the finetuning stage, with the only difference being the upgraded base model to Qwen2-7B. Considering the improvements in the Qwen2 series models compared to the Qwen1.5 series, we can also expect consistent performance enhancements in the embedding models.

The model incorporates several key advancements:

Integration of bidirectional attention mechanisms, enriching its contextual understanding.
Instruction tuning, applied solely on the query side for streamlined efficiency
Comprehensive training across a vast, multilingual text corpus spanning diverse domains and scenarios. This training leverages both weakly supervised and supervised data, ensuring the model's applicability across numerous languages and a wide array of downstream tasks.

Model Information

Model Size: 7B
Embedding Dimension: 3584
Max Input Tokens: 32k

https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct

gte-Qwen2-7B-instruct is the latest model in the gte (General Text Embedding) model family that ranks No.1 in both English and Chinese evaluations on the Massive Text Embedding Benchmark MTEB benchmark (as of June 16, 2024).

Recently, the Qwen team released the Qwen2 series models, and we have trained the gte-Qwen2-7B-instruct model based on the Qwen2-7B LLM model. Compared to the gte-Qwen1.5-7B-instruct model, the gte-Qwen2-7B-instruct model uses the same training data and training strategies during the finetuning stage, with the only difference being the upgraded base model to Qwen2-7B. Considering the improvements in the Qwen2 series models compared to the Qwen1.5 series, we can also expect consistent performance enhancements in the embedding models.

The model incorporates several key advancements:

Integration of bidirectional attention mechanisms, enriching its contextual understanding.
    Instruction tuning, applied solely on the query side for streamlined efficiency
    Comprehensive training across a vast, multilingual text corpus spanning diverse domains and scenarios. This training leverages both weakly supervised and supervised data, ensuring the model's applicability across numerous languages and a wide array of downstream tasks.

Model Information

Model Size: 7B
    Embedding Dimension: 3584
    Max Input Tokens: 32k

[https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)