arXiv cs.CV June 24, 2026 · Papers

HANCLIP: A Family of Hyperbolic Angular Negation Vision Language Models

arXiv:2606.23843v1 Announce Type: new Abstract: Vision-Language Models (VLMs) are typically pre-trained on large-scale image-text datasets to capture semantic correspondences between visual content and natural language. However, they remain surprisingly brittle to negation: models often rely on shallow word co-occurren

Read original