all-MiniLM-L6-v2
embedding
318 Pulls Updated 7 months ago
Updated 7 months ago
7 months ago
5828eca1915b · 91MB
model
archbert
·
parameters22.6M
·
quantizationF32
91MB
template
{{ .Prompt }}
13B
params
{"num_ctx":256}
16B
Readme
https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2
git lfs install
git clone https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2
git clone git@github.com:ggerganov/llama.cpp.git
cd llama.cpp/
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python convert-hf-to-gguf.py --outfile minilm.gguf --outtype f32 ../all-MiniLM-L6-v2/