Skip to content
X · @teortaxesTex · X / Twitter

«Gains come from models failing on different questions, not from adding more models.» Haven't read it yet but sounds right. Mixture of Models, where…

«Gains come from models failing on different questions, not from adding more models.»Haven't read it yet but sounds right. Mixture of Models, where all models are general-purpose competing LLMs, is a cope. Just train experts, do MOPD, and then do single-model test time scaling.Xiuyu Li: The paper argues that for any en