What's so special about BERT's layers? A closer look at the NLP pipeline in monolingual and multilingual models

Wietse de Vries, Andreas van Cranenburgh, Malvina Nissim. What's so special about BERT's layers? A closer look at the NLP pipeline in monolingual and multilingual models. In Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, EMNLP 2020, Online Event, 16-20 November 2020. pages 4339-4350, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.