Skip to content
llama.cpp releases · Infrastructure

b9870

chat: trim messages sent to StepFun parser (fixes long reasoning loops) (#25238) chat: trim messages sent to StepFun parser (fixes long reasoning loops) add regression test; remove duplicate template chat: trim StepFun content parts before rendering The StepFun trim workaround ran on the already-rendered messages, wher