tazarov/all-minilm-l6-v2-f32

tazarov/ all-minilm-l6-v2-f32

11.6K Downloads Updated 1 year ago

all-MiniLM-L6-v2

embedding

ollama pull tazarov/all-minilm-l6-v2-f32

curl http://localhost:11434/api/embed \
  -d '{
    "model": "tazarov/all-minilm-l6-v2-f32",
    "input": "Why is the sky blue?"
  }'

import ollama

response = ollama.embed(
    model='tazarov/all-minilm-l6-v2-f32',
    input='The sky is blue because of Rayleigh scattering',
)
print(response.embeddings)

import ollama from 'ollama'

const response = await ollama.embed({
  model: 'tazarov/all-minilm-l6-v2-f32',
  input: 'The sky is blue because of Rayleigh scattering',
})
console.log(response.embeddings)

Models

View all →

Name

1 model

Size

Context

Input

all-minilm-l6-v2-f32:latest

91MB · 512 context window · Text · 1 year ago

all-minilm-l6-v2-f32:latest

91MB

512

Text

Readme

https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2

git lfs install
git clone https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2
git clone git@github.com:ggerganov/llama.cpp.git
cd llama.cpp/
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python convert-hf-to-gguf.py --outfile minilm.gguf --outtype f32 ../all-MiniLM-L6-v2/