Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Jianfeng Gao, Kam-Fai Wong, Baolin Peng, Jingjing Liu, Xiujun Li. Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. In Iryna Gurevych, Yusuke Miyao, editors, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers. pages 2182-2192, Association for Computational Linguistics, 2018. [doi]

Authors

Jianfeng Gao

This author has not been identified. Look up 'Jianfeng Gao' in Google

Kam-Fai Wong

This author has not been identified. Look up 'Kam-Fai Wong' in Google

Baolin Peng

This author has not been identified. Look up 'Baolin Peng' in Google

Jingjing Liu

This author has not been identified. Look up 'Jingjing Liu' in Google

Xiujun Li

This author has not been identified. Look up 'Xiujun Li' in Google