63 2 months ago

reduced s5 default context to 4096 tokens to allow better local inference thinks in chinese somtimes??