40 2 years ago

Fine-tuned version of llama2-v0.1-instruct from BanglaLLM in huggingface. Quantized to 4bit -> q4_k_m using llama.cpp.

ollama run kaizu/bn_chat

Models

View all →

1 model

bn_chat:latest

4.2GB · 4K context window · Text · 2 years ago

Readme

No readme