XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Russlan Salakhutdinov, Quoc V. Le. XLNet: Generalized Autoregressive Pretraining for Language Understanding. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 5754-5764, 2019. [doi]

@inproceedings{YangDYCSL19,
  title = {XLNet: Generalized Autoregressive Pretraining for Language Understanding},
  author = {Zhilin Yang and Zihang Dai and Yiming Yang and Jaime G. Carbonell and Russlan Salakhutdinov and Quoc V. Le},
  year = {2019},
  url = {http://papers.nips.cc/paper/8812-xlnet-generalized-autoregressive-pretraining-for-language-understanding},
  researchr = {https://researchr.org/publication/YangDYCSL19},
  cites = {0},
  citedby = {0},
  pages = {5754-5764},
  booktitle = {Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada},
  editor = {Hanna M. Wallach and Hugo Larochelle and Alina Beygelzimer and Florence d'Alché-Buc and Edward A. Fox and Roman Garnett},
}