{"ID":2854150,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.15710","arxiv_id":"2510.15710","title":"UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis","abstract":"Medical workflows routinely combine reading images with producing visual and textual outputs, making both image understanding and generation central to medical AI. Most existing systems, however, address these abilities in isolated models, losing the shared knowledge that a unified architecture could exploit. To bridge this gap, we present UniMedVL, the first unified medical model that seamlessly integrates multimodal understanding and generation capabilities within a single model without switching weights. We achieve this via a tailored progressive training pipeline where understanding and generation mutually reinforce each other. To effectively train UniMedVL, we curate UniMedVL-5M, the first large-scale medical dataset comprising over 5.6M instances across 8 medical imaging modalities, tailored for multimodal input-output tasks in unified medical understanding and generation. Experimental results demonstrate that UniMedVL achieves competitive performance on five medical understanding benchmarks. Crucially, UniMedVL natively supports diverse interleaved generation tasks, e.g., virtual staining, super-resolution, cross-modal synthesis, essential for complex medical workflows. Our code and dataset are publicly available.","short_abstract":"Medical workflows routinely combine reading images with producing visual and textual outputs, making both image understanding and generation central to medical AI. Most existing systems, however, address these abilities in isolated models, losing the shared knowledge that a unified architecture could exploit. To bridge...","url_abs":"https://arxiv.org/abs/2510.15710","url_pdf":"https://arxiv.org/pdf/2510.15710v3","authors":"[\"Junzhi Ning\",\"Wei Li\",\"Cheng Tang\",\"Jiashi Lin\",\"Chenglong Ma\",\"Chaoyang Zhang\",\"Jiyao Liu\",\"Ying Chen\",\"Shujian Gao\",\"Yuandong Pu\",\"Huihui Xu\",\"Chenhui Gou\",\"Ziyan Huang\",\"Yi Xin\",\"Qi Qin\",\"Diping Song\",\"Bin Fu\",\"Guang Yang\",\"Yuanfeng Ji\",\"Tianbin Li\",\"Yanzhou Su\",\"Jin Ye\",\"Shixiang Tang\",\"Zhongying Deng\",\"Lihao Liu\",\"Ming Hu\",\"Junjun He\"]","published":"2025-10-17T14:54:58Z","proceeding":"cs.CV","tasks":"[\"cs.CV\"]","methods":"[]","has_code":false}
