Policy oscillation is overshooting

Paul Wagner. Policy oscillation is overshooting. Neural Networks, 52:43-61, 2014. [doi]

Abstract

Abstract is missing.