Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning

Daniel Marta, Christian Pek, Gaspar Isaac MelsiĆ³n, Jana Tumova, Iolanda Leite. Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning. IEEE Robotics and Automation Letters, 7(1):406-413, 2022. [doi]

Abstract

Abstract is missing.