Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Jianfeng Gao, Kam-Fai Wong, Baolin Peng, Jingjing Liu, Xiujun Li. Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. In Iryna Gurevych, Yusuke Miyao, editors, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers. pages 2182-2192, Association for Computational Linguistics, 2018. [doi]

Abstract

Abstract is missing.