Policy invariance under reward transformations for multi-objective reinforcement learning

Patrick Mannion, Sam Devlin, Karl Mason, Jim Duggan, Enda Howley. Policy invariance under reward transformations for multi-objective reinforcement learning. Neurocomputing, 263:60-73, 2017. [doi]

Abstract

Abstract is missing.