BRPO: Batch Residual Policy Optimization

Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier. BRPO: Batch Residual Policy Optimization. In Christian Bessiere, editor, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020 [scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic]. pages 2824-2830, ijcai.org, 2020. [doi]

Abstract

Abstract is missing.