{"ID":2837455,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.18934","arxiv_id":"2511.18934","title":"Skeletons Matter: Dynamic Data Augmentation for Text-to-Query","abstract":"The task of translating natural language questions into query languages has long been a central focus in semantic parsing. Recent advancements in Large Language Models (LLMs) have significantly accelerated progress in this field. However, existing studies typically focus on a single query language, resulting in methods with limited generalizability across different languages. In this paper, we formally define the Text-to-Query task paradigm, unifying semantic parsing tasks across various query languages. We identify query skeletons as a shared optimization target of Text-to-Query tasks, and propose a general dynamic data augmentation framework that explicitly diagnoses model-specific weaknesses in handling these skeletons to synthesize targeted training data. Experiments on four Text-to-Query benchmarks demonstrate that our method achieves state-of-the-art performance using only a small amount of synthesized data, highlighting the efficiency and generality of our approach and laying a solid foundation for unified research on Text-to-Query tasks. We release our code at https://github.com/jjjycaptain/Skeletron.","short_abstract":"The task of translating natural language questions into query languages has long been a central focus in semantic parsing. Recent advancements in Large Language Models (LLMs) have significantly accelerated progress in this field. However, existing studies typically focus on a single query language, resulting in methods...","url_abs":"https://arxiv.org/abs/2511.18934","url_pdf":"https://arxiv.org/pdf/2511.18934v1","authors":"[\"Yuchen Ji\",\"Bo Xu\",\"Jie Shi\",\"Jiaqing Liang\",\"Deqing Yang\",\"Yu Mao\",\"Hai Chen\",\"Yanghua Xiao\"]","published":"2025-11-24T09:39:03Z","proceeding":"cs.CL","tasks":"[\"cs.CL\",\"cs.AI\",\"cs.DB\"]","methods":"[\"Large Language Model\",\"Language Model\"]","has_code":false,"code_links":[{"ID":606693,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2837455,"paper_url":"https://arxiv.org/abs/2511.18934","paper_title":"Skeletons Matter: Dynamic Data Augmentation for Text-to-Query","repo_url":"https://github.com/jjjycaptain/Skeletron","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
