{"ID":2894337,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.11153","arxiv_id":"2507.11153","title":"Assessing Color Vision Test in Large Vision-language Models","abstract":"With the widespread adoption of large vision-language models, the capacity for color vision in these models is crucial. However, the color vision abilities of large visual-language models have not yet been thoroughly explored. To address this gap, we define a color vision testing task for large vision-language models and construct a dataset \\footnote{Anonymous Github Showing some of the data https://anonymous.4open.science/r/color-vision-test-dataset-3BCD} that covers multiple categories of test questions and tasks of varying difficulty levels. Furthermore, we analyze the types of errors made by large vision-language models and propose fine-tuning strategies to enhance their performance in color vision tests.","short_abstract":"With the widespread adoption of large vision-language models, the capacity for color vision in these models is crucial. However, the color vision abilities of large visual-language models have not yet been thoroughly explored. To address this gap, we define a color vision testing task for large vision-language models a...","url_abs":"https://arxiv.org/abs/2507.11153","url_pdf":"https://arxiv.org/pdf/2507.11153v1","authors":"[\"Hongfei Ye\",\"Bin Chen\",\"Wenxi Liu\",\"Yu Zhang\",\"Zhao Li\",\"Dandan Ni\",\"Hongyang Chen\"]","published":"2025-07-15T10:03:06Z","proceeding":"cs.CV","tasks":"[\"cs.CV\",\"cs.AI\"]","methods":"[\"Language Model\"]","project_urls":"[\"https://anonymous.4open.science/r/color-vision-test-dataset-3BCD\"]","has_code":false}
