The following publications are possibly variants of this publication:
- Natural belief-critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systemsFilip Jurcícek, Blaise Thomson, Simon Keizer, François Mairesse, Milica Gasic, Kai Yu, Steve Young. interspeech 2010: 90-93 [doi]
- Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue ManagementPei-hao Su, Pawel Budzianowski, Stefan Ultes, Milica Gasic, Steve J. Young. sigdial 2017: 147-157 [doi]