Regret Bounds for Reinforcement Learning with Policy Advice

Mohammad Gheshlaghi Azar, Alessandro Lazaric, Emma Brunskill. Regret Bounds for Reinforcement Learning with Policy Advice. In Hendrik Blockeel, Kristian Kersting, Siegfried Nijssen, Filip Zelezný, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2013, Prague, Czech Republic, September 23-27, 2013, Proceedings, Part I. Volume 8188 of Lecture Notes in Computer Science, pages 97-112, Springer, 2013. [doi]

@inproceedings{AzarLB13,
  title = {Regret Bounds for Reinforcement Learning with Policy Advice},
  author = {Mohammad Gheshlaghi Azar and Alessandro Lazaric and Emma Brunskill},
  year = {2013},
  doi = {10.1007/978-3-642-40988-2_7},
  url = {http://dx.doi.org/10.1007/978-3-642-40988-2_7},
  researchr = {https://researchr.org/publication/AzarLB13},
  cites = {0},
  citedby = {0},
  pages = {97-112},
  booktitle = {Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2013, Prague, Czech Republic, September 23-27, 2013, Proceedings, Part I},
  editor = {Hendrik Blockeel and Kristian Kersting and Siegfried Nijssen and Filip Zelezný},
  volume = {8188},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-642-40987-5},
}