{"ID":2834192,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2512.03234","arxiv_id":"2512.03234","title":"Iterative Tilting for Diffusion Fine-Tuning","abstract":"We introduce iterative tilting, a gradient-free method for fine-tuning diffusion models toward reward-tilted distributions. The method decomposes a large reward tilt $\\exp(λr)$ into $N$ sequential smaller tilts, each admitting a tractable score update via first-order Taylor expansion. This requires only forward evaluations of the reward function and avoids backpropagating through sampling chains. We validate on a two-dimensional Gaussian mixture with linear reward, where the exact tilted distribution is available in closed form.","short_abstract":"We introduce iterative tilting, a gradient-free method for fine-tuning diffusion models toward reward-tilted distributions. The method decomposes a large reward tilt $\\exp(λr)$ into $N$ sequential smaller tilts, each admitting a tractable score update via first-order Taylor expansion. This requires only forward evaluat...","url_abs":"https://arxiv.org/abs/2512.03234","url_pdf":"https://arxiv.org/pdf/2512.03234v1","authors":"[\"Jean Pachebat\",\"Giovanni Conforti\",\"Alain Durmus\",\"Yazid Janati\"]","published":"2025-12-02T21:07:46Z","proceeding":"stat.ML","tasks":"[\"stat.ML\",\"cs.LG\"]","methods":"[\"Diffusion Model\"]","has_code":false}
