Scale Efficiently: Insights from Pretraining and Finetuning Transformers

Yi Tay, Mostafa Dehghani 0001, Jinfeng Rao, William Fedus, Samira Abnar, Hyung Won Chung, Sharan Narang, Dani Yogatama, Ashish Vaswani, Donald Metzler. Scale Efficiently: Insights from Pretraining and Finetuning Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Abstract

Abstract is missing.