2 an hour ago

A short, low-LR LoRA training exercise was conducted for Cicikus-v3-1.4B using the Opus-4.6-Reasoning dataset.

{
"num_ctx": 32768,
"repeat_penalty": 1.2,
"temperature": 0.7,
"top_k": 40,
"top_p": 0.9
}