{"ID":2852012,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.21805","arxiv_id":"2510.21805","title":"DiffGRM: Diffusion-based Generative Recommendation Model","abstract":"Generative recommendation (GR) is an emerging paradigm that represents each item via a tokenizer as an n-digit semantic ID (SID) and predicts the next item by autoregressively generating its SID conditioned on the user's history. However, two structural properties of SIDs make ARMs ill-suited. First, intra-item consistency: the n digits jointly specify one item, yet the left-to-right causality trains each digit only under its prefix and blocks bidirectional cross-digit evidence, collapsing supervision to a single causal path. Second, inter-digit heterogeneity: digits differ in semantic granularity and predictability, while the uniform next-token objective assigns equal weight to all digits, overtraining easy digits and undertraining hard digits. To address these two issues, we propose DiffGRM, a diffusion-based GR model that replaces the autoregressive decoder with a masked discrete diffusion model (MDM), thereby enabling bidirectional context and any-order parallel generation of SID digits for recommendation. Specifically, we tailor DiffGRM in three aspects: (1) tokenization with Parallel Semantic Encoding (PSE) to decouple digits and balance per-digit information; (2) training with On-policy Coherent Noising (OCN) that prioritizes uncertain digits via coherent masking to concentrate supervision on high-value signals; and (3) inference with Confidence-guided Parallel Denoising (CPD) that fills higher-confidence digits first and generates diverse Top-K candidates. Experiments show consistent gains over strong generative and discriminative recommendation baselines on multiple datasets, improving NDCG@10 by 6.9%-15.5%. Code is available at https://github.com/liuzhao09/DiffGRM.","short_abstract":"Generative recommendation (GR) is an emerging paradigm that represents each item via a tokenizer as an n-digit semantic ID (SID) and predicts the next item by autoregressively generating its SID conditioned on the user's history. However, two structural properties of SIDs make ARMs ill-suited. First, intra-item consist...","url_abs":"https://arxiv.org/abs/2510.21805","url_pdf":"https://arxiv.org/pdf/2510.21805v1","authors":"[\"Zhao Liu\",\"Yichen Zhu\",\"Yiqing Yang\",\"Guoping Tang\",\"Rui Huang\",\"Qiang Luo\",\"Xiao Lv\",\"Ruiming Tang\",\"Kun Gai\",\"Guorui Zhou\"]","published":"2025-10-21T03:23:32Z","proceeding":"cs.IR","tasks":"[\"cs.IR\",\"cs.AI\",\"cs.LG\"]","methods":"[\"Diffusion Model\"]","has_code":false,"code_links":[{"ID":607953,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2852012,"paper_url":"https://arxiv.org/abs/2510.21805","paper_title":"DiffGRM: Diffusion-based Generative Recommendation Model","repo_url":"https://github.com/liuzhao09/DiffGRM","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
