Model-free safe policy learning via hard action barrier functions

Agustin Castellano, Juan-Andrés Bazerque, Enrique Mallada. Model-free safe policy learning via hard action barrier functions. In 55th Annual Conference on Information Sciences and Systems, CISS 2021, Baltimore, MD, USA, March 24-26, 2021. pages 1, IEEE, 2021. [doi]

Authors

Agustin Castellano

This author has not been identified. Look up 'Agustin Castellano' in Google

Juan-Andrés Bazerque

This author has not been identified. Look up 'Juan-Andrés Bazerque' in Google

Enrique Mallada

This author has not been identified. Look up 'Enrique Mallada' in Google