Potential-based reward shaping for finite horizon online POMDP planning

Adam Eck, Leen-Kiat Soh, Sam Devlin, Daniel Kudenko. Potential-based reward shaping for finite horizon online POMDP planning. Autonomous Agents and Multi-Agent Systems, 30(3):403-445, 2016. [doi]

@article{EckSDK16,
  title = {Potential-based reward shaping for finite horizon online POMDP planning},
  author = {Adam Eck and Leen-Kiat Soh and Sam Devlin and Daniel Kudenko},
  year = {2016},
  doi = {10.1007/s10458-015-9292-6},
  url = {http://dx.doi.org/10.1007/s10458-015-9292-6},
  researchr = {https://researchr.org/publication/EckSDK16},
  cites = {0},
  citedby = {0},
  journal = {Autonomous Agents and Multi-Agent Systems},
  volume = {30},
  number = {3},
  pages = {403-445},
}