Non-convergence of stochastic gradient descent in the training of deep neural networks

Patrick Cheridito, Arnulf Jentzen, Florian Rossmannek. Non-convergence of stochastic gradient descent in the training of deep neural networks. J. Complexity, 64:101540, 2021. [doi]

Abstract

Abstract is missing.