Do Transformer Modifications Transfer Across Implementations and Applications?

Sharan Narang, Hyung Won Chung, Yi Tay, Liam Fedus, Thibault FĂ©vry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li 0133, Nan Ding, Jake Marcus, Adam Roberts, Colin Raffel. Do Transformer Modifications Transfer Across Implementations and Applications?. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. pages 5758-5773, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.