596 1 year ago

Quantized 4-bit version of the original Chocolatine LLM, best performing 3B model on the OpenLLM Leaderboard.