Linear Convergence of Gradient Descent For Finite Width Over-parametrized Linear Networks With General Initialization

Ziqing Xu, Hancheng Min, Salma Tarmoun, Enrique Mallada, René Vidal. Linear Convergence of Gradient Descent For Finite Width Over-parametrized Linear Networks With General Initialization. In Francisco J. R. Ruiz, Jennifer G. Dy, Jan-Willem van de Meent, editors, International Conference on Artificial Intelligence and Statistics, 25-27 April 2023, Palau de Congressos, Valencia, Spain. Volume 206 of Proceedings of Machine Learning Research, pages 2262-2284, PMLR, 2023. [doi]

Abstract

Abstract is missing.