Yi Tay, Mostafa Dehghani 0001, Jai Prakash Gupta, Vamsi Aribandi, Dara Bahri, Zhen Qin 0002, Donald Metzler. Are Pretrained Convolutions Better than Pretrained Transformers?. In Chengqing Zong, Fei Xia, Wenjie Li 0002, Roberto Navigli, editors, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021. pages 4349-4359, Association for Computational Linguistics, 2021. [doi]
Abstract is missing.