Lessons on off-policy methods from a notification component of a chatbot

Scott Rome, Tianwen Chen, Michael Kreisel, Ding Zhou. Lessons on off-policy methods from a notification component of a chatbot. Machine Learning, 110(9):2577-2602, 2021. [doi]

Abstract

Abstract is missing.