{"ID":2840375,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.12998","arxiv_id":"2511.12998","title":"PerTouch: VLM-Driven Agent for Personalized and Semantic Image Retouching","abstract":"Image retouching aims to enhance visual quality while aligning with users' personalized aesthetic preferences. To address the challenge of balancing controllability and subjectivity, we propose a unified diffusion-based image retouching framework called PerTouch. Our method supports semantic-level image retouching while maintaining global aesthetics. Using parameter maps containing attribute values in specific semantic regions as input, PerTouch constructs an explicit parameter-to-image mapping for fine-grained image retouching. To improve semantic boundary perception, we introduce semantic replacement and parameter perturbation mechanisms during training. To connect natural language instructions with visual control, we develop a VLM-driven agent to handle both strong and weak user instructions. Equipped with mechanisms of feedback-driven rethinking and scene-aware memory, PerTouch better aligns with user intent and captures long-term preferences. Extensive experiments demonstrate each component's effectiveness and the superior performance of PerTouch in personalized image retouching. Code Pages: https://github.com/Auroral703/PerTouch.","short_abstract":"Image retouching aims to enhance visual quality while aligning with users' personalized aesthetic preferences. To address the challenge of balancing controllability and subjectivity, we propose a unified diffusion-based image retouching framework called PerTouch. Our method supports semantic-level image retouching whil...","url_abs":"https://arxiv.org/abs/2511.12998","url_pdf":"https://arxiv.org/pdf/2511.12998v3","authors":"[\"Zewei Chang\",\"Zheng-Peng Duan\",\"Jianxing Zhang\",\"Chun-Le Guo\",\"Siyu Liu\",\"Hyungju Chun\",\"Hyunhee Park\",\"Zikun Liu\",\"Chongyi Li\"]","published":"2025-11-17T05:39:15Z","proceeding":"cs.CV","tasks":"[\"cs.CV\"]","methods":"[\"Diffusion Model\"]","has_code":false,"code_links":[{"ID":606962,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2840375,"paper_url":"https://arxiv.org/abs/2511.12998","paper_title":"PerTouch: VLM-Driven Agent for Personalized and Semantic Image Retouching","repo_url":"https://github.com/Auroral703/PerTouch","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
