On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Alec Koppel. On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 1716-1731, PMLR, 2022. [doi]

Authors

Amrit Singh Bedi

This author has not been identified. Look up 'Amrit Singh Bedi' in Google

Souradip Chakraborty

This author has not been identified. Look up 'Souradip Chakraborty' in Google

Anjaly Parayil

This author has not been identified. Look up 'Anjaly Parayil' in Google

Brian M. Sadler

This author has not been identified. Look up 'Brian M. Sadler' in Google

Pratap Tokekar

This author has not been identified. Look up 'Pratap Tokekar' in Google

Alec Koppel

This author has not been identified. Look up 'Alec Koppel' in Google