The following publications are possibly variants of this publication:
- A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy LearningYang Zhao, Hua Qin, Zhenyu Wang, Changxi Zhu, Shihan Wang. naacl 2022: 711-723 [doi]
- Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy LearningYangyang Zhao, Zhenyu Wang, Zhenhua Huang. AAAI 2021: 14540-14548 [doi]
- Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy LearningYangyang Zhao, Kai Yin, Zhenyu Wang 0001, Mehdi Dastani, Shihan Wang 0001. taslp, 32:1380-1391, 2024. [doi]