Invariance in Policy Optimisation and Partial Identifiability in Reward Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Joar Max Viktor Skalse, Matthew Farrugia-Roberts, Stuart Russell 0001, Alessandro Abate, Adam Gleave. Invariance in Policy Optimisation and Partial Identifiability in Reward Learning. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 32033-32058, PMLR, 2023. [doi]

This author has not been identified. Look up 'Joar Max Viktor Skalse' in GoogleThis author has not been identified. Look up 'Matthew Farrugia-Roberts' in GoogleThis author has not been identified. Look up 'Stuart Russell 0001' in GoogleThis author has not been identified. Look up 'Alessandro Abate' in GoogleThis author has not been identified. Look up 'Adam Gleave' in Google

runs on WebDSL