Patrick Mannion, Sam Devlin, Karl Mason, Jim Duggan, Enda Howley. Policy invariance under reward transformations for multi-objective reinforcement learning. Neurocomputing, 263:60-73, 2017. [doi]
No references recorded for this publication.
No citations of this publication recorded.