Policy invariance under reward transformations for multi-objective reinforcement learning

Patrick Mannion, Sam Devlin, Karl Mason, Jim Duggan, Enda Howley. Policy invariance under reward transformations for multi-objective reinforcement learning. Neurocomputing, 263:60-73, 2017. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.