llama.cpp releases
· Infrastructure
b9870
chat: trim messages sent to StepFun parser (fixes long reasoning loops) (#25238) chat: trim messages sent to StepFun parser (fixes long reasoning loops) add regression test; remove duplicate template chat: trim StepFun content parts before rendering The StepFun trim workaround ran on the already-rendered messages, wher