{"ID":2830739,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2512.09682","arxiv_id":"2512.09682","title":"Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies","abstract":"This work studies the application of Multi-Agent Reinforcement Learning (MARL) to decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for MARL scaling studies. A robust baseline policy is proposed which restricts agent motion and applies Dijkstra's shortest path algorithm. Computational experiment results show that two off-the-shelf MARL algorithms perform competitively with the baseline for a small number of agents, but face scalability issues as the number of agents increases. Source code and animations are available online at https://github.com/mikapersson/Information-Relaying.","short_abstract":"This work studies the application of Multi-Agent Reinforcement Learning (MARL) to decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for MARL scaling studies. A robust baseline policy is propos...","url_abs":"https://arxiv.org/abs/2512.09682","url_pdf":"https://arxiv.org/pdf/2512.09682v2","authors":"[\"Mika Persson\",\"Jonas Lidman\",\"Jacob Ljungberg\",\"Samuel Sandelius\",\"Adam Andersson\"]","published":"2025-12-10T14:29:04Z","proceeding":"eess.SY","tasks":"[\"eess.SY\",\"cs.AI\",\"cs.GT\",\"cs.MA\"]","methods":"[\"Reinforcement Learning\"]","has_code":false,"code_links":[{"ID":606070,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2830739,"paper_url":"https://arxiv.org/abs/2512.09682","paper_title":"Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies","repo_url":"https://github.com/mikapersson/Information-Relaying","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}