Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System

Chang Tian, Wenpeng Yin 0002, Marie-Francine Moens. Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System. In Marine Carpuat, Marie-Catherine de Marneffe, Iván Vladimir Meza Ruíz, editors, Findings of the Association for Computational Linguistics: NAACL 2022, Seattle, WA, United States, July 10-15, 2022. pages 565-577, Association for Computational Linguistics, 2022. [doi]

Abstract

Abstract is missing.