55 1 year ago

5.7B MQA base model of DeepSeek-Coder