{"ID":2859439,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.06079","arxiv_id":"2510.06079","title":"A Simple Adaptive Proximal Gradient Method for Nonconvex Optimization","abstract":"Consider composite nonconvex optimization problems where the objective function consists of a smooth nonconvex term (with Lipschitz-continuous gradient) and a convex (possibly nonsmooth) term. Existing parameter-free methods for such problems often rely on complex multi-loop structures, require line searches, or depend on restrictive assumptions (e.g., bounded iterates). To address these limitations, we introduce a novel adaptive proximal gradient method (referred to as AdaPGNC) that features a simple single-loop structure, eliminates the need for line searches, and only requires the gradient's Lipschitz continuity to ensure convergence. Furthermore, AdaPGNC achieves the theoretically optimal iteration/gradient evaluation complexity of $\\mathcal{O}(\\varepsilon^{-2})$ for finding an $\\varepsilon$-stationary point. Our core innovation lies in designing an adaptive step size strategy that leverages upper and lower curvature estimates. A key technical contribution is the development of a novel Lyapunov function that effectively balances the function value gap and the norm-squared of consecutive iterate differences, serving as a central component in our convergence analysis. Preliminary experimental results indicate that AdaPGNC demonstrates competitive performance on several benchmark nonconvex (and convex) problems against state-of-the-art parameter-free methods.","short_abstract":"Consider composite nonconvex optimization problems where the objective function consists of a smooth nonconvex term (with Lipschitz-continuous gradient) and a convex (possibly nonsmooth) term. Existing parameter-free methods for such problems often rely on complex multi-loop structures, require line searches, or depend...","url_abs":"https://arxiv.org/abs/2510.06079","url_pdf":"https://arxiv.org/pdf/2510.06079v1","authors":"[\"Zilong Ye\",\"Shiqian Ma\",\"Junfeng Yang\",\"Danqing Zhou\"]","published":"2025-10-07T16:05:22Z","proceeding":"math.OC","tasks":"[\"math.OC\"]","methods":"[]","has_code":false}
