Which Neural Net Architectures Give Rise to Exploding and Vanishing Gradients?

Boris Hanin. Which Neural Net Architectures Give Rise to Exploding and Vanishing Gradients?. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, Roman Garnett, editors, Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada. pages 580-589, 2018. [doi]

Abstract

Abstract is missing.