Sihan Wang, Kaijie Zhou, Kunfeng Lai, Jianping Shen. Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network. In Bonnie Webber, Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020. pages 3461-3471, Association for Computational Linguistics, 2020. [doi]
Abstract is missing.