{"ID":2845670,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.03220","arxiv_id":"2511.03220","title":"Multimodal-Wireless: A Large-Scale Dataset for Sensing and Communication","abstract":"This paper presents Multimodal-Wireless, a large-scale open-source dataset for multimodal sensing and communication research. The dataset is generated through an integrated and customizable data pipeline built upon the CARLA simulator and Sionna framework, and features high-resolution communication channel state information (CSI) fully synchronized with five other sensor modalities, namely LiDAR, RGB and depth camera, inertial measurement unit (IMU) and radar, all sampled at 100 Hz. It contains approximately 160,000 frames collected across four virtual towns, sixteen communication scenarios, and three weather conditions. This paper provides a comprehensive overview of the dataset, outlining its key features, overall framework, and technical implementation details. In addition, it explores potential research applications concerning communication and collaborative perception, exemplified by beam prediction using a multimodal large language model. The dataset is open in https://le-liang.github.io/mmw/.","short_abstract":"This paper presents Multimodal-Wireless, a large-scale open-source dataset for multimodal sensing and communication research. The dataset is generated through an integrated and customizable data pipeline built upon the CARLA simulator and Sionna framework, and features high-resolution communication channel state inform...","url_abs":"https://arxiv.org/abs/2511.03220","url_pdf":"https://arxiv.org/pdf/2511.03220v2","authors":"[\"Tianhao Mao\",\"Le Liang\",\"Jie Yang\",\"Hao Ye\",\"Shi Jin\",\"Geoffrey Ye Li\"]","published":"2025-11-05T06:15:00Z","proceeding":"eess.SP","tasks":"[\"eess.SP\"]","methods":"[\"Language Model\"]","has_code":false}
