Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Joar Max Viktor Skalse, Matthew Farrugia-Roberts, Stuart Russell 0001, Alessandro Abate, Adam Gleave. Invariance in Policy Optimisation and Partial Identifiability in Reward Learning. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 32033-32058, PMLR, 2023. [doi]

Authors

Joar Max Viktor Skalse

This author has not been identified. Look up 'Joar Max Viktor Skalse' in Google

Matthew Farrugia-Roberts

This author has not been identified. Look up 'Matthew Farrugia-Roberts' in Google

Stuart Russell 0001

This author has not been identified. Look up 'Stuart Russell 0001' in Google

Alessandro Abate

This author has not been identified. Look up 'Alessandro Abate' in Google

Adam Gleave

This author has not been identified. Look up 'Adam Gleave' in Google