{"ID":2854490,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.14398","arxiv_id":"2510.14398","title":"YNTP-100: A Benchmark for Your Next Token Prediction with 100 People","abstract":"Large language models (LLMs) trained for general \\textit{next-token prediction} often fail to generate responses that reflect how specific individuals communicate. Progress on personalized alignment is further limited by the difficulty of collecting real-world personal communication data due to privacy constraints. We propose Your Next Token Prediction (YNTP), a task that formulates personalized response generation as token-level prediction conditioned on user interaction history. We introduce \\textbf{YNTP-100}, a benchmark built from multilingual multi-day human--agent conversations with 100 people, enabling systematic evaluation of user-specific response behavior. We evaluate external (parameter-preserving) and internal (parameter-updating) alignment methods using metrics of substance similarity and stylistic consistency. The dataset and results are publicly available at: https://github.com/AnonymousHub4Submissions/YNTP100.","short_abstract":"Large language models (LLMs) trained for general \\textit{next-token prediction} often fail to generate responses that reflect how specific individuals communicate. Progress on personalized alignment is further limited by the difficulty of collecting real-world personal communication data due to privacy constraints. We...","url_abs":"https://arxiv.org/abs/2510.14398","url_pdf":"https://arxiv.org/pdf/2510.14398v3","authors":"[\"Shiyao Ding\",\"Takayuki Ito\"]","published":"2025-10-16T07:54:02Z","proceeding":"cs.CL","tasks":"[\"cs.CL\"]","methods":"[\"Large Language Model\",\"Language Model\"]","has_code":false,"code_links":[{"ID":608159,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2854490,"paper_url":"https://arxiv.org/abs/2510.14398","paper_title":"YNTP-100: A Benchmark for Your Next Token Prediction with 100 People","repo_url":"https://github.com/AnonymousHub4Submissions/YNTP100","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
