GPT-SW3 is a collection of large decoder-only pretrained transformer language models that were developed by AI Sweden in collaboration with RISE and the WASP WARA for Media and Language.

536 Pulls Updated 4 months ago

57d856510dff · 47B
{ "num_gpu": 0, "stop": [ "<s>", "User:" ] }