{"ID":2843479,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.08369","arxiv_id":"2511.08369","title":"Text-based Aerial-Ground Person Retrieval","abstract":"This work introduces Text-based Aerial-Ground Person Retrieval (TAG-PR), which aims to retrieve person images from heterogeneous aerial and ground views with textual descriptions. Unlike traditional Text-based Person Retrieval (T-PR), which focuses solely on ground-view images, TAG-PR introduces greater practical significance and presents unique challenges due to the large viewpoint discrepancy across images. To support this task, we contribute: (1) TAG-PEDES dataset, constructed from public benchmarks with automatically generated textual descriptions, enhanced by a diversified text generation paradigm to ensure robustness under view heterogeneity; and (2) TAG-CLIP, a novel retrieval framework that addresses view heterogeneity through a hierarchically-routed mixture of experts module to learn view-specific and view-agnostic features and a viewpoint decoupling strategy to decouple view-specific features for better cross-modal alignment. We evaluate the effectiveness of TAG-CLIP on both the proposed TAG-PEDES dataset and existing T-PR benchmarks. The dataset and code are available at https://github.com/Flame-Chasers/TAG-PR.","short_abstract":"This work introduces Text-based Aerial-Ground Person Retrieval (TAG-PR), which aims to retrieve person images from heterogeneous aerial and ground views with textual descriptions. Unlike traditional Text-based Person Retrieval (T-PR), which focuses solely on ground-view images, TAG-PR introduces greater practical signi...","url_abs":"https://arxiv.org/abs/2511.08369","url_pdf":"https://arxiv.org/pdf/2511.08369v1","authors":"[\"Xinyu Zhou\",\"Yu Wu\",\"Jiayao Ma\",\"Wenhao Wang\",\"Min Cao\",\"Mang Ye\"]","published":"2025-11-11T15:49:04Z","proceeding":"cs.CV","tasks":"[\"cs.CV\",\"cs.AI\"]","methods":"[\"Mixture of Experts\"]","has_code":false,"code_links":[{"ID":607218,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2843479,"paper_url":"https://arxiv.org/abs/2511.08369","paper_title":"Text-based Aerial-Ground Person Retrieval","repo_url":"https://github.com/Flame-Chasers/TAG-PR","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
