{"ID":2881543,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2508.12524","arxiv_id":"2508.12524","title":"Results of the NeurIPS 2023 Neural MMO Competition on Multi-task Reinforcement Learning","abstract":"We present the results of the NeurIPS 2023 Neural MMO Competition, which attracted over 200 participants and submissions. Participants trained goal-conditional policies that generalize to tasks, maps, and opponents never seen during training. The top solution achieved a score 4x higher than our baseline within 8 hours of training on a single 4090 GPU. We open-source everything relating to Neural MMO and the competition under the MIT license, including the policy weights and training code for our baseline and for the top submissions.","short_abstract":"We present the results of the NeurIPS 2023 Neural MMO Competition, which attracted over 200 participants and submissions. Participants trained goal-conditional policies that generalize to tasks, maps, and opponents never seen during training. The top solution achieved a score 4x higher than our baseline within 8 hours...","url_abs":"https://arxiv.org/abs/2508.12524","url_pdf":"https://arxiv.org/pdf/2508.12524v1","authors":"[\"Joseph Suárez\",\"Kyoung Whan Choe\",\"David Bloomin\",\"Jianming Gao\",\"Yunkun Li\",\"Yao Feng\",\"Saidinesh Pola\",\"Kun Zhang\",\"Yonghui Zhu\",\"Nikhil Pinnaparaju\",\"Hao Xiang Li\",\"Nishaanth Kanna\",\"Daniel Scott\",\"Ryan Sullivan\",\"Rose S. Shuman\",\"Lucas de Alcântara\",\"Herbie Bradley\",\"Kirsty You\",\"Bo Wu\",\"Yuhao Jiang\",\"Qimai Li\",\"Jiaxin Chen\",\"Louis Castricato\",\"Xiaolong Zhu\",\"Phillip Isola\"]","published":"2025-08-17T23:14:25Z","proceeding":"cs.LG","tasks":"[\"cs.LG\"]","methods":"[\"Reinforcement Learning\"]","has_code":false}