{"ID":2852380,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.18934","arxiv_id":"2510.18934","title":"Position: Many generalization measures for deep learning are fragile","abstract":"In this position paper, we argue that many post-mortem generalization measures -- those computed on trained networks -- are \\textbf{fragile}: small training modifications that barely affect the performance of the underlying deep neural network can substantially change a measure's value, trend, or scaling behavior. For example, minor hyperparameter changes, such as learning rate adjustments or switching between SGD variants, can reverse the slope of a learning curve in widely used generalization measures such as the path norm. We also identify subtler forms of fragility. For instance, the PAC-Bayes origin measure is regarded as one of the most reliable, and is indeed less sensitive to hyperparameter tweaks than many other measures. However, it completely fails to capture differences in data complexity across learning curves. This data fragility contrasts with the function-based marginal-likelihood PAC-Bayes bound, which does capture differences in data-complexity, including scaling behavior, in learning curves, but which is not a post-mortem measure. Beyond demonstrating that many post-mortem bounds are fragile, this position paper also argues that developers of new measures should explicitly audit them for fragility.","short_abstract":"In this position paper, we argue that many post-mortem generalization measures -- those computed on trained networks -- are \\textbf{fragile}: small training modifications that barely affect the performance of the underlying deep neural network can substantially change a measure's value, trend, or scaling behavior. For...","url_abs":"https://arxiv.org/abs/2510.18934","url_pdf":"https://arxiv.org/pdf/2510.18934v3","authors":"[\"Shuofeng Zhang\",\"Ard Louis\"]","published":"2025-10-21T17:44:41Z","proceeding":"cs.LG","tasks":"[\"cs.LG\"]","methods":"[]","has_code":false}
