47 6 months ago

This is a distill model that trained from the dataset of TieBa latest. Used about 8k data and think chain from DeepSeek-V3.

741e943dbd4c · 248B
<|begin▁of▁sentence|>{{ if .System }}{{ .System }}{{ end }}{{ range .Messages }}{{ if eq .Role "user" }}<|User|>{{ .Content }}<|Assistant|>{{ else if eq .Role "assistant" }}{{ .Content }}<|end▁of▁sentence|>{{ end }}{{ end }}