arXiv cs.AI
· Papers
IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations
arXiv:2606.28556v1 Announce Type: new Abstract: Recent advances in large language models and vision-language models have enabled reasoning over multimodal data, offering opportunities for clinical applications such as decision support and triaging. However, existing medical AI benchmarks are fragmented: some support mu