PAC-optimal, Non-parametric Algorithms and Bounds for Exploration in Concurrent MDPs with Delayed Updates

Jason Pazis. PAC-optimal, Non-parametric Algorithms and Bounds for Exploration in Concurrent MDPs with Delayed Updates. PhD thesis, Duke University, Durham, NC, USA, 2015. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: