Generalization bounds for score-based generative models: a synthetic proof
Abstract
We establish minimax convergence rates for score-based generative models (SGMs) under the $1$-Wasserstein distance. Assuming the target density $p^\star$ lies in a nonparametric $β$-smooth Hölder class with either compact support or subGaussian tails on $\mathbb{R}^d$, we prove that neural network-based score estimators trained via denoising score matching yield generative models achieving rate $n^{-(β+1)/(2β+d)}$ up to polylogarithmic factors. Our unified analysis handles arbitrary smoothness $β> 0$, supports both deterministic and stochastic samplers, and leverages shape constraints on $p^\star$ to induce regularity of the score. The resulting proofs are more concise, and grounded in generic stability of diffusions and standard approximation theory.