XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Russlan Salakhutdinov, Quoc V. Le. XLNet: Generalized Autoregressive Pretraining for Language Understanding. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 5754-5764, 2019. [doi]

Authors

Zhilin Yang

This author has not been identified. Look up 'Zhilin Yang' in Google

Zihang Dai

This author has not been identified. Look up 'Zihang Dai' in Google

Yiming Yang

This author has not been identified. Look up 'Yiming Yang' in Google

Jaime G. Carbonell

This author has not been identified. Look up 'Jaime G. Carbonell' in Google

Russlan Salakhutdinov

This author has not been identified. Look up 'Russlan Salakhutdinov' in Google

Quoc V. Le

This author has not been identified. Look up 'Quoc V. Le' in Google