279 8 months ago

Quantized 4-bit version of the original Chocolatine LLM, best performing 13B model on the OpenLLM Leaderboard.