arXiv cs.CV
· Papers
Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding
arXiv:2603.19054v2 Announce Type: replace Abstract: Recent advances in Streaming Video Understanding has enabled a new interaction paradigm where models respond proactively to user queries. Current proactive VideoLLMs rely on per-frame triggering decision making, which suffers from an efficiency-accuracy dilemma. We pr