Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity

Jingzhao Zhang, Tianxing He, Suvrit Sra, Ali Jadbabaie. Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. [doi]

Authors

Jingzhao Zhang

This author has not been identified. Look up 'Jingzhao Zhang' in Google

Tianxing He

This author has not been identified. Look up 'Tianxing He' in Google

Suvrit Sra

This author has not been identified. Look up 'Suvrit Sra' in Google

Ali Jadbabaie

This author has not been identified. Look up 'Ali Jadbabaie' in Google