Multi-Action Dialog Policy Learning from Logged User Feedback

Shuo Zhang, Junzhou Zhao, Pinghui Wang, Tianxiang Wang, Zi Liang, Jing Tao, Yi Huang, Junlan Feng. Multi-Action Dialog Policy Learning from Logged User Feedback. In Brian Williams 0001, Yiling Chen 0001, Jennifer Neville, editors, Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023. pages 13976-13983, AAAI Press, 2023. [doi]

Abstract

Abstract is missing.