Inferring bounds on the performance of a control policy from a sample of trajectories

Raphaƫl Fonteneau, Susan A. Murphy, Louis Wehenkel, Damien Ernst. Inferring bounds on the performance of a control policy from a sample of trajectories. In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009, Nashville, TN, USA, March 31 - April 1, 2009. pages 117-123, IEEE, 2009. [doi]

Abstract

Abstract is missing.