Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies

Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge. Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 11795-11807, PMLR, 2021. [doi]

Authors

Tsung-Yen Yang

This author has not been identified. Look up 'Tsung-Yen Yang' in Google

Justinian Rosca

This author has not been identified. Look up 'Justinian Rosca' in Google

Karthik Narasimhan

This author has not been identified. Look up 'Karthik Narasimhan' in Google

Peter J. Ramadge

This author has not been identified. Look up 'Peter J. Ramadge' in Google