Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System

Chang Tian, Wenpeng Yin 0002, Marie-Francine Moens. Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System. In Marine Carpuat, Marie-Catherine de Marneffe, Iván Vladimir Meza Ruíz, editors, Findings of the Association for Computational Linguistics: NAACL 2022, Seattle, WA, United States, July 10-15, 2022. pages 565-577, Association for Computational Linguistics, 2022. [doi]

Authors

Chang Tian

This author has not been identified. Look up 'Chang Tian' in Google

Wenpeng Yin 0002

This author has not been identified. Look up 'Wenpeng Yin 0002' in Google

Marie-Francine Moens

This author has not been identified. Look up 'Marie-Francine Moens' in Google