How does Gradient Descent Learn Features - A Local Analysis for Regularized Two-Layer Neural Networks

Mo Zhou, Rong Ge 0001. How does Gradient Descent Learn Features - A Local Analysis for Regularized Two-Layer Neural Networks. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Mo Zhou

This author has not been identified. Look up 'Mo Zhou' in Google

Rong Ge 0001

This author has not been identified. Look up 'Rong Ge 0001' in Google