{"ID":2889299,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.21977","arxiv_id":"2507.21977","title":"Motion Matters: Motion-guided Modulation Network for Skeleton-based Micro-Action Recognition","abstract":"Micro-Actions (MAs) are an important form of non-verbal communication in social interactions, with potential applications in human emotional analysis. However, existing methods in Micro-Action Recognition often overlook the inherent subtle changes in MAs, which limits the accuracy of distinguishing MAs with subtle changes. To address this issue, we present a novel Motion-guided Modulation Network (MMN) that implicitly captures and modulates subtle motion cues to enhance spatial-temporal representation learning. Specifically, we introduce a Motion-guided Skeletal Modulation module (MSM) to inject motion cues at the skeletal level, acting as a control signal to guide spatial representation modeling. In parallel, we design a Motion-guided Temporal Modulation module (MTM) to incorporate motion information at the frame level, facilitating the modeling of holistic motion patterns in micro-actions. Finally, we propose a motion consistency learning strategy to aggregate the motion cues from multi-scale features for micro-action classification. Experimental results on the Micro-Action 52 and iMiGUE datasets demonstrate that MMN achieves state-of-the-art performance in skeleton-based micro-action recognition, underscoring the importance of explicitly modeling subtle motion cues. The code will be available at https://github.com/momiji-bit/MMN.","short_abstract":"Micro-Actions (MAs) are an important form of non-verbal communication in social interactions, with potential applications in human emotional analysis. However, existing methods in Micro-Action Recognition often overlook the inherent subtle changes in MAs, which limits the accuracy of distinguishing MAs with subtle chan...","url_abs":"https://arxiv.org/abs/2507.21977","url_pdf":"https://arxiv.org/pdf/2507.21977v4","authors":"[\"Jihao Gu\",\"Kun Li\",\"Fei Wang\",\"Yanyan Wei\",\"Zhiliang Wu\",\"Hehe Fan\",\"Meng Wang\"]","published":"2025-07-29T16:27:10Z","proceeding":"cs.CV","tasks":"[\"cs.CV\"]","methods":"[]","has_code":false,"code_links":[{"ID":611637,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2889299,"paper_url":"https://arxiv.org/abs/2507.21977","paper_title":"Motion Matters: Motion-guided Modulation Network for Skeleton-based Micro-Action Recognition","repo_url":"https://github.com/momiji-bit/MMN","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
