BRPO: Batch Residual Policy Optimization

Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier. BRPO: Batch Residual Policy Optimization. In Christian Bessiere, editor, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020 [scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic]. pages 2824-2830, ijcai.org, 2020. [doi]

Authors

Sungryull Sohn

This author has not been identified. Look up 'Sungryull Sohn' in Google

Yinlam Chow

This author has not been identified. Look up 'Yinlam Chow' in Google

Jayden Ooi

This author has not been identified. Look up 'Jayden Ooi' in Google

Ofir Nachum

This author has not been identified. Look up 'Ofir Nachum' in Google

Honglak Lee

This author has not been identified. Look up 'Honglak Lee' in Google

Ed Chi

This author has not been identified. Look up 'Ed Chi' in Google

Craig Boutilier

This author has not been identified. Look up 'Craig Boutilier' in Google