Preference-Based Reinforcement Learning Using Dyad Ranking

Dirk Schäfer, Eyke Hüllermeier. Preference-Based Reinforcement Learning Using Dyad Ranking. In Larisa N. Soldatova, Joaquin Vanschoren, George A. Papadopoulos, Michelangelo Ceci, editors, Discovery Science - 21st International Conference, DS 2018, Limassol, Cyprus, October 29-31, 2018, Proceedings. Volume 11198 of Lecture Notes in Computer Science, pages 161-175, Springer, 2018. [doi]

@inproceedings{SchaferH18-0,
  title = {Preference-Based Reinforcement Learning Using Dyad Ranking},
  author = {Dirk Schäfer and Eyke Hüllermeier},
  year = {2018},
  doi = {10.1007/978-3-030-01771-2_11},
  url = {https://doi.org/10.1007/978-3-030-01771-2_11},
  researchr = {https://researchr.org/publication/SchaferH18-0},
  cites = {0},
  citedby = {0},
  pages = {161-175},
  booktitle = {Discovery Science - 21st International Conference, DS 2018, Limassol, Cyprus, October 29-31, 2018, Proceedings},
  editor = {Larisa N. Soldatova and Joaquin Vanschoren and George A. Papadopoulos and Michelangelo Ceci},
  volume = {11198},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-030-01771-2},
}