Preference-learning based Inverse Reinforcement Learning for Dialog Control

Hiroaki Sugiyama, Toyomi Meguro, Yasuhiro Minami. Preference-learning based Inverse Reinforcement Learning for Dialog Control. In INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. pages 222-225, ISCA, 2012. [doi]

@inproceedings{SugiyamaMM12,
  title = {Preference-learning based Inverse Reinforcement Learning for Dialog Control},
  author = {Hiroaki Sugiyama and Toyomi Meguro and Yasuhiro Minami},
  year = {2012},
  url = {http://interspeech2012.org/accepted-abstract.html?id=916},
  researchr = {https://researchr.org/publication/SugiyamaMM12},
  cites = {0},
  citedby = {0},
  pages = {222-225},
  booktitle = {INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012},
  publisher = {ISCA},
}