Skip to content
arXiv cs.AI · Papers

Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

arXiv:2606.27483v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated strong capability in sequential decision-making, yet they remains fundamentally reactive in long-horizon tasks. Unlike humans who employ "what-if" reasoning to evaluate potential plans before commitment, standard agents