Arnulf Jentzen, Adrian Riekert. A proof of convergence for the gradient descent optimization method with random initializations in the training of neural networks with ReLU activation for piecewise linear target functions. Journal of Machine Learning Research, 23, 2022. [doi]
Abstract is missing.