Guided Dialogue Policy Learning without Adversarial Learning in the Loop

Ziming Li 0001, Sungjin Lee, Baolin Peng, Jinchao Li, Julia Kiseleva, Maarten de Rijke, Shahin Shayandeh, Jianfeng Gao. Guided Dialogue Policy Learning without Adversarial Learning in the Loop. In Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, EMNLP 2020, Online Event, 16-20 November 2020. pages 2308-2317, Association for Computational Linguistics, 2020. [doi]

Authors

Ziming Li 0001

This author has not been identified. Look up 'Ziming Li 0001' in Google

Sungjin Lee

This author has not been identified. Look up 'Sungjin Lee' in Google

Baolin Peng

This author has not been identified. Look up 'Baolin Peng' in Google

Jinchao Li

This author has not been identified. Look up 'Jinchao Li' in Google

Julia Kiseleva

This author has not been identified. Look up 'Julia Kiseleva' in Google

Maarten de Rijke

This author has not been identified. It may be one of the following persons: Look up 'Maarten de Rijke' in Google

Shahin Shayandeh

This author has not been identified. Look up 'Shahin Shayandeh' in Google

Jianfeng Gao

This author has not been identified. Look up 'Jianfeng Gao' in Google