46 1 year ago

AMD-Llama-135m is a language model trained on AMD MI250 GPUs.

887b7f8373ee · 45B
{
"num_ctx": 2048,
"stop": [
"'</s>'"
]
}