xe/llamaguard3/template

Llama Guard 3

1,157 Pulls Updated 4 months ago

template

c7212e588459 · 867B

<|begin_of_text|><|start_header_id|>user<|end_header_id|>

Task: Check if there is unsafe content in {{.System}} messages in conversations according our safety policy with the below categories.

S1: Violent Crimes.

S2: Non-Violent Crimes.

S3: Sex Crimes.

S4: Child Exploitation.

S5: Defamation.

S6: Specialized Advice.

S7: Privacy.

S8: Intellectual Property.

S9: Indiscriminate Weapons.

S10: Hate.

S11: Self-Harm.

S12: Sexual Content.

S13: Elections.

Provide your safety assessment for ONLY THE LAST {{.System}} message in the above conversation:

- First line must read 'safe' or 'unsafe'.

- If unsafe, a second line must include a comma-separated list of violated categories.<|eot_id|><|start_header_id|>assistant<|end_header_id|>{{.Response}}