Skip to content
arXiv cs.CV · Papers

Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding

arXiv:2603.19054v2 Announce Type: replace Abstract: Recent advances in Streaming Video Understanding has enabled a new interaction paradigm where models respond proactively to user queries. Current proactive VideoLLMs rely on per-frame triggering decision making, which suffers from an efficiency-accuracy dilemma. We pr