On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability

Vincent François-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau. On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability. J. Artif. Intell. Res. (JAIR), 65:1-30, 2019. [doi]

Abstract

Abstract is missing.