Gradient Descent with Identity Initialization Efficiently Learns Positive-Definite Linear Transformations by Deep Residual Networks

Peter L. Bartlett, David P. Helmbold, Philip M. Long. Gradient Descent with Identity Initialization Efficiently Learns Positive-Definite Linear Transformations by Deep Residual Networks. Neural Computation, 31(3), 2019. [doi]

Authors

Peter L. Bartlett

This author has not been identified. Look up 'Peter L. Bartlett' in Google

David P. Helmbold

This author has not been identified. Look up 'David P. Helmbold' in Google

Philip M. Long

This author has not been identified. Look up 'Philip M. Long' in Google