Sample-efficient batch reinforcement learning for dialogue management optimization

Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan, Hervé Frezza-Buet. Sample-efficient batch reinforcement learning for dialogue management optimization. TSLP, 7(3):7, 2011. [doi]

Abstract

Abstract is missing.