Skip to content
arXiv cs.CV · Papers

SC3-Eval: Evaluating Robot Foundation Models via Self-Consistent Video Generation

arXiv:2606.18610v2 Announce Type: replace-cross Abstract: Evaluating generalist robot manipulation policies in the real world is expensive, slow, and difficult to scale. Action-conditioned video world models offer a scalable alternative by simulating policy rollouts. Autoregressive rollouts accumulate compounding error