WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation for Multi-turn Dialogue

Anant Khandelwal. WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation for Multi-turn Dialogue. In Song Feng, Siva Reddy, Malihe Alikhani, He He, Yangfeng Ji, Mohit Iyyer, Zhou Yu, editors, Proceedings of the 1st Workshop on Document-grounded Dialogue and Conversational Question Answering, DialDoc@ACL-IJCNLP 2021, Online, August 5, 2021. pages 69-80, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.