1,449 1 year ago

GPT-SW3 is a collection of large decoder-only pretrained transformer language models that were developed by AI Sweden in collaboration with RISE and the WASP WARA for Media and Language.

1.3b 6.7b 20b
57d856510dff · 47B
{
"num_gpu": 0,
"stop": [
"<s>",
"User:"
]
}