{"ID":2889510,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.20560","arxiv_id":"2507.20560","title":"Statistical Inference for Differentially Private Stochastic Gradient Descent","abstract":"Privacy preservation in machine learning, particularly through Differentially Private Stochastic Gradient Descent (DP-SGD), is critical for sensitive data analysis. However, existing statistical inference methods for SGD predominantly focus on cyclic subsampling, while DP-SGD requires randomized subsampling. This paper first bridges this gap by establishing the asymptotic properties of SGD under the randomized rule and extending these results to DP-SGD. For the output of DP-SGD, we show that the asymptotic variance decomposes into statistical, sampling, and privacy-induced components. Two methods are proposed for constructing valid confidence intervals: the plug-in method and the random scaling method. We also perform extensive numerical analysis, which shows that the proposed confidence intervals achieve nominal coverage rates while maintaining privacy.","short_abstract":"Privacy preservation in machine learning, particularly through Differentially Private Stochastic Gradient Descent (DP-SGD), is critical for sensitive data analysis. However, existing statistical inference methods for SGD predominantly focus on cyclic subsampling, while DP-SGD requires randomized subsampling. This paper...","url_abs":"https://arxiv.org/abs/2507.20560","url_pdf":"https://arxiv.org/pdf/2507.20560v2","authors":"[\"Xintao Xia\",\"Linjun Zhang\",\"Zhanrui Cai\"]","published":"2025-07-28T06:45:15Z","proceeding":"stat.ML","tasks":"[\"stat.ML\",\"cs.LG\",\"stat.ME\"]","methods":"[]","has_code":false}
