arXiv cs.CV
· Papers
SC3-Eval: Evaluating Robot Foundation Models via Self-Consistent Video Generation
arXiv:2606.18610v2 Announce Type: replace-cross Abstract: Evaluating generalist robot manipulation policies in the real world is expensive, slow, and difficult to scale. Action-conditioned video world models offer a scalable alternative by simulating policy rollouts. Autoregressive rollouts accumulate compounding error