Off-policy learning in large-scale POMDP-based dialogue systems

Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Off-policy learning in large-scale POMDP-based dialogue systems. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012. pages 4989-4992, IEEE, 2012. [doi]

Abstract

Abstract is missing.