Safe Reinforcement Learning through Phasic Safety-Oriented Policy Optimization

Sumanta Dey, Pallab Dasgupta, Soumyajit Dey. Safe Reinforcement Learning through Phasic Safety-Oriented Policy Optimization. In Gabriel Pedroza, Xiaowei Huang 0001, Xin Cynthia Chen, Andreas Theodorou, José Hernández-Orallo, Mauricio Castillo-Effen, Richard Mallah, John A. McDermid, editors, Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), Washington DC, USA, February 13-14, 2023. Volume 3381 of CEUR Workshop Proceedings, CEUR-WS.org, 2023. [doi]

Authors

Sumanta Dey

This author has not been identified. Look up 'Sumanta Dey' in Google

Pallab Dasgupta

This author has not been identified. Look up 'Pallab Dasgupta' in Google

Soumyajit Dey

This author has not been identified. Look up 'Soumyajit Dey' in Google