{"ID":2858738,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.06938","arxiv_id":"2510.06938","title":"Expressive and Scalable Quantum Fusion for Multimodal Learning","abstract":"The aim of this paper is to introduce a quantum fusion mechanism for multimodal learning and to establish its theoretical and empirical potential. The proposed method, called the Quantum Fusion Layer (QFL), replaces classical fusion schemes with a hybrid quantum-classical procedure that uses parameterized quantum circuits to learn entangled feature interactions without requiring exponential parameter growth. Supported by quantum signal processing principles, the quantum component efficiently represents high-order polynomial interactions across modalities with linear parameter scaling, and we provide a separation example between QFL and low-rank tensor-based methods that highlights potential quantum query advantages. In simulation, QFL consistently outperforms strong classical baselines on small but diverse multimodal tasks, with particularly marked improvements in high-modality regimes. These results suggest that QFL offers a fundamentally new and scalable approach to multimodal fusion that merits deeper exploration on larger systems.","short_abstract":"The aim of this paper is to introduce a quantum fusion mechanism for multimodal learning and to establish its theoretical and empirical potential. The proposed method, called the Quantum Fusion Layer (QFL), replaces classical fusion schemes with a hybrid quantum-classical procedure that uses parameterized quantum circu...","url_abs":"https://arxiv.org/abs/2510.06938","url_pdf":"https://arxiv.org/pdf/2510.06938v1","authors":"[\"Tuyen Nguyen\",\"Trong Nghia Hoang\",\"Phi Le Nguyen\",\"Hai L. Vu\",\"Truong Cong Thang\"]","published":"2025-10-08T12:19:44Z","proceeding":"quant-ph","tasks":"[\"quant-ph\",\"cs.AI\"]","methods":"[\"LoRA\"]","has_code":false}
