Model-free safe policy learning via hard action barrier functions

Agustin Castellano, Juan-Andrés Bazerque, Enrique Mallada. Model-free safe policy learning via hard action barrier functions. In 55th Annual Conference on Information Sciences and Systems, CISS 2021, Baltimore, MD, USA, March 24-26, 2021. pages 1, IEEE, 2021. [doi]

Abstract

Abstract is missing.