GPT-SW3 is a collection of large decoder-only pretrained transformer language models that were developed by AI Sweden in collaboration with RISE and the WASP WARA for Media and Language.

536 Pulls Updated 4 months ago

a173c2b11c05 · 35B
{ "stop": [ "<s>", "User:" ] }