Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers - researchr publication

researchr

You are not signed in
Sign in
Sign up

Zhuohan Li, Eric Wallace, Sheng Shen, Kevin Lin, Kurt Keutzer, Dan Klein, Joey Gonzalez. Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event. Volume 119 of Proceedings of Machine Learning Research, pages 5958-5968, PMLR, 2020. [doi]

Abstract is missing.

runs on WebDSL