Skip to content
arXiv cs.AI · Papers

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations

arXiv:2606.28556v1 Announce Type: new Abstract: Recent advances in large language models and vision-language models have enabled reasoning over multimodal data, offering opportunities for clinical applications such as decision support and triaging. However, existing medical AI benchmarks are fragmented: some support mu