{"ID":2843132,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.07821","arxiv_id":"2511.07821","title":"SynTTS-Commands: A Public Dataset for On-Device KWS via TTS-Synthesized Multilingual Speech","abstract":"The development of high-performance, on-device keyword spotting (KWS) systems for ultra-low-power hardware is critically constrained by the scarcity of specialized, multi-command training datasets. Traditional data collection through human recording is costly, slow, and lacks scalability. This paper introduces SYNTTS-COMMANDS, a novel, multilingual voice command dataset entirely generated using state-of-the-art Text-to-Speech (TTS) synthesis. By leveraging the CosyVoice 2 model and speaker embeddings from public corpora, we created a scalable collection of English and Chinese commands. Extensive benchmarking across a range of efficient acoustic models demonstrates that our synthetic dataset enables exceptional accuracy, achieving up to 99.5\\% on English and 98\\% on Chinese command recognition. These results robustly validate that synthetic speech can effectively replace human-recorded audio for training KWS classifiers. Our work directly addresses the data bottleneck in TinyML, providing a practical, scalable foundation for building private, low-latency, and energy-efficient voice interfaces on resource-constrained edge devices. The dataset and source code are publicly available at https://github.com/lugan113/SynTTS-Commands-Official.","short_abstract":"The development of high-performance, on-device keyword spotting (KWS) systems for ultra-low-power hardware is critically constrained by the scarcity of specialized, multi-command training datasets. Traditional data collection through human recording is costly, slow, and lacks scalability. This paper introduces SYNTTS-C...","url_abs":"https://arxiv.org/abs/2511.07821","url_pdf":"https://arxiv.org/pdf/2511.07821v2","authors":"[\"Lu Gan\",\"Xi Li\"]","published":"2025-11-11T04:38:06Z","proceeding":"cs.SD","tasks":"[\"cs.SD\"]","methods":"[\"Generative Adversarial Network\"]","has_code":false,"code_links":[{"ID":607186,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2843132,"paper_url":"https://arxiv.org/abs/2511.07821","paper_title":"SynTTS-Commands: A Public Dataset for On-Device KWS via TTS-Synthesized Multilingual Speech","repo_url":"https://github.com/lugan113/SynTTS-Commands-Official","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
