{"ID":2863057,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.21733","arxiv_id":"2510.21733","title":"Augmenting Researchy Questions with Sub-question Judgments","abstract":"The Researchy Questions dataset provides about 100k question queries with complex information needs that require retrieving information about several aspects of a topic. Each query in ResearchyQuestions is associated with sub-questions that were produced by prompting GPT-4. While ResearchyQuestions contains labels indicating what documents were clicked after issuing the query, there are no associations in the dataset between sub-questions and relevant documents. In this work, we augment the Researchy Questions dataset with LLM-judged labels for each sub-question using a Llama3.3 70B model. We intend these sub-question labels to serve as a resource for training retrieval models that better support complex information needs.","short_abstract":"The Researchy Questions dataset provides about 100k question queries with complex information needs that require retrieving information about several aspects of a topic. Each query in ResearchyQuestions is associated with sub-questions that were produced by prompting GPT-4. While ResearchyQuestions contains labels indi...","url_abs":"https://arxiv.org/abs/2510.21733","url_pdf":"https://arxiv.org/pdf/2510.21733v1","authors":"[\"Jia-Huei Ju\",\"Eugene Yang\",\"Trevor Adriaanse\",\"Andrew Yates\"]","published":"2025-09-30T19:27:34Z","proceeding":"cs.IR","tasks":"[\"cs.IR\"]","methods":"[\"Large Language Model\"]","has_code":false}