Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

Alexandru Meterez, Amir Joudaki, Francesco Orabona, Alexander Immer, Gunnar Rätsch, Hadi Daneshmand. Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion. In The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024. [doi]

Abstract

Abstract is missing.