Counterfactual Off-Policy Training for Neural Dialogue Generation

Qingfu Zhu, Wei-Nan Zhang 0003, Ting Liu, William Yang Wang. Counterfactual Off-Policy Training for Neural Dialogue Generation. In Bonnie Webber, Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020. pages 3438-3448, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.