GPT-SW3 is a collection of large decoder-only pretrained transformer language models that were developed by AI Sweden in collaboration with RISE and the WASP WARA for Media and Language.

536 Pulls Updated 4 months ago

5e843787bac9 · 50B
{ "num_batch": 16, "stop": [ "<s>", "User:" ] }