Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture

Christopher Brix, Parnia Bahar, Hermann Ney. Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 3909-3915, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.