Anderson-type acceleration method for Deep Neural Network optimization

math.NA arXiv:2510.20254
View PDF arXiv JSON

Abstract

In this paper we consider the neural network optimization. We develop Anderson-type acceleration method for the stochastic gradient decent method and it improves the network permanence very much. We demonstrate the applicability of the method for Deep Neural Network (DNN) and Convolution Neural Network (CNN).

PDF Viewer