Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro. Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023. pages 7763-7786, Association for Computational Linguistics, 2023. [doi]

Authors

Boxin Wang

This author has not been identified. Look up 'Boxin Wang' in Google

Wei Ping

This author has not been identified. Look up 'Wei Ping' in Google

Peng Xu

This author has not been identified. Look up 'Peng Xu' in Google

Lawrence McAfee

This author has not been identified. Look up 'Lawrence McAfee' in Google

Zihan Liu

This author has not been identified. Look up 'Zihan Liu' in Google

Mohammad Shoeybi

This author has not been identified. Look up 'Mohammad Shoeybi' in Google

Yi Dong

This author has not been identified. Look up 'Yi Dong' in Google

Oleksii Kuchaiev

This author has not been identified. Look up 'Oleksii Kuchaiev' in Google

Bo Li

This author has not been identified. Look up 'Bo Li' in Google

Chaowei Xiao

This author has not been identified. Look up 'Chaowei Xiao' in Google

Anima Anandkumar

This author has not been identified. Look up 'Anima Anandkumar' in Google

Bryan Catanzaro

This author has not been identified. Look up 'Bryan Catanzaro' in Google