Gradient Descent Optimizes Normalization-Free ResNets

Zongpeng Zhang, Zenan Ling, Tong Lin 0002, Zhouchen Lin. Gradient Descent Optimizes Normalization-Free ResNets. In International Joint Conference on Neural Networks, IJCNN 2023, Gold Coast, Australia, June 18-23, 2023. pages 1-8, IEEE, 2023. [doi]

Abstract

Abstract is missing.