EPMC: Every Visit Preference Monte Carlo for Reinforcement Learning

Christian Wirth, Johannes Fürnkranz. EPMC: Every Visit Preference Monte Carlo for Reinforcement Learning. In Cheng Soon Ong, Tu Bao Ho, editors, Asian Conference on Machine Learning, ACML 2013, Canberra, ACT, Australia, November 13-15, 2013. Volume 29 of JMLR Proceedings, pages 483-497, JMLR.org, 2013. [doi]

@inproceedings{WirthF13-0,
  title = {EPMC: Every Visit Preference Monte Carlo for Reinforcement Learning},
  author = {Christian Wirth and Johannes Fürnkranz},
  year = {2013},
  url = {http://jmlr.org/proceedings/papers/v29/Wirth13.html},
  researchr = {https://researchr.org/publication/WirthF13-0},
  cites = {0},
  citedby = {0},
  pages = {483-497},
  booktitle = {Asian Conference on Machine Learning, ACML 2013, Canberra, ACT, Australia, November 13-15, 2013},
  editor = {Cheng Soon Ong and Tu Bao Ho},
  volume = {29},
  series = {JMLR Proceedings},
  publisher = {JMLR.org},
}