AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training

Thiago D. Simão, Nils Jansen 0001, Matthijs T. J. Spaan. AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training. In Frank Dignum, Alessio Lomuscio, Ulle Endriss, Ann Nowé, editors, AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, Virtual Event, United Kingdom, May 3-7, 2021. pages 1226-1235, ACM, 2021. [doi]

Authors

Thiago D. Simão

This author has not been identified. Look up 'Thiago D. Simão' in Google

Nils Jansen 0001

This author has not been identified. Look up 'Nils Jansen 0001' in Google

Matthijs T. J. Spaan

This author has not been identified. Look up 'Matthijs T. J. Spaan' in Google