Build Ollama For Reranker_v2 (LoRA BERT）

embedding

ollama pull AuditAid/Reranker_v2:Qwen3-Reranker-4B_Q5_K_M

curl http://localhost:11434/api/embed \
  -d '{
    "model": "AuditAid/Reranker_v2:Qwen3-Reranker-4B_Q5_K_M",
    "input": "Why is the sky blue?"
  }'

import ollama

response = ollama.embed(
    model='AuditAid/Reranker_v2:Qwen3-Reranker-4B_Q5_K_M',
    input='The sky is blue because of Rayleigh scattering',
)
print(response.embeddings)

import ollama from 'ollama'

const response = await ollama.embed({
  model: 'AuditAid/Reranker_v2:Qwen3-Reranker-4B_Q5_K_M',
  input: 'The sky is blue because of Rayleigh scattering',
})
console.log(response.embeddings)

Details

Updated 6 months ago

6 months ago

5e1a8097ab8f · 2.9GB ·

model

archqwen3

parameters4.02B

quantizationQ5_K_M

2.9GB

template

<|im_start|>system Judge whether the Document meets the requirements based on the Query and the Inst

326B

Readme

2025.12.29 update 更新

后续官方如有支持，则继续更新，目前推荐用llama.cpp部署

Build Ollama For Reranker_v2 (LoRA BERT）

Details

Readme

2025.12.29 update 更新

后续官方如有支持，则继续更新，目前推荐用llama.cpp部署

部署链接参考：https://github.com/AuditAIH/rerank_for_dify