PAC-optimal, Non-parametric Algorithms and Bounds for Exploration in Concurrent MDPs with Delayed Updates

Jason Pazis. PAC-optimal, Non-parametric Algorithms and Bounds for Exploration in Concurrent MDPs with Delayed Updates. PhD thesis, Duke University, Durham, NC, USA, 2015. [doi]

No reviews for this publication, yet.