Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning

Guanlin Wu, Wenqi Fang, Ji Wang, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu, Zheng Wang. Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning. In Chengqing Zong, Fei Xia, Wenjie Li 0002, Roberto Navigli, editors, Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, Online Event, August 1-6, 2021. pages 1786-1795, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.