Gradient Starvation: A Learning Proclivity in Neural Networks

Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron C. Courville, Doina Precup, Guillaume Lajoie. Gradient Starvation: A Learning Proclivity in Neural Networks. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 1256-1272, 2021. [doi]

Authors

Mohammad Pezeshki

This author has not been identified. Look up 'Mohammad Pezeshki' in Google

Sékou-Oumar Kaba

This author has not been identified. Look up 'Sékou-Oumar Kaba' in Google

Yoshua Bengio

This author has not been identified. Look up 'Yoshua Bengio' in Google

Aaron C. Courville

This author has not been identified. Look up 'Aaron C. Courville' in Google

Doina Precup

This author has not been identified. Look up 'Doina Precup' in Google

Guillaume Lajoie

This author has not been identified. Look up 'Guillaume Lajoie' in Google