Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences

Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan. Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences. In Toby Walsh, editor, IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain, July 16-22, 2011. pages 1878-1883, IJCAI/AAAI, 2011. [doi]

Abstract

Abstract is missing.