Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh. Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Sreejith Balakrishnan

This author has not been identified. Look up 'Sreejith Balakrishnan' in Google

Quoc Phong Nguyen

This author has not been identified. Look up 'Quoc Phong Nguyen' in Google

Bryan Kian Hsiang Low

This author has not been identified. Look up 'Bryan Kian Hsiang Low' in Google

Harold Soh

This author has not been identified. Look up 'Harold Soh' in Google