SGD vs GD: Rank Deficiency in Linear Networks

Aditya Vardhan Varre, Margarita Sagitova, Nicolas Flammarion. SGD vs GD: Rank Deficiency in Linear Networks. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Aditya Vardhan Varre

This author has not been identified. Look up 'Aditya Vardhan Varre' in Google

Margarita Sagitova

This author has not been identified. Look up 'Margarita Sagitova' in Google

Nicolas Flammarion

This author has not been identified. Look up 'Nicolas Flammarion' in Google