Gradient descent optimizes over-parameterized deep ReLU networks

Difan Zou, Yuan Cao, Dongruo Zhou, Quanquan Gu. Gradient descent optimizes over-parameterized deep ReLU networks. Machine Learning, 109(3):467-492, 2020. [doi]

Abstract

Abstract is missing.