Potential-based reward shaping for finite horizon online POMDP planning

Adam Eck, Leen-Kiat Soh, Sam Devlin, Daniel Kudenko. Potential-based reward shaping for finite horizon online POMDP planning. Autonomous Agents and Multi-Agent Systems, 30(3):403-445, 2016. [doi]

Abstract

Abstract is missing.