Skip to content
arXiv cs.CV · Papers

IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models

arXiv:2604.00757v2 Announce Type: replace Abstract: Large Vision Language Models show impressive performance across image and video understanding tasks, yet their computational cost grows rapidly with the number of visual tokens. Existing token pruning methods mitigate this issue through empirical approaches while over