Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning

Daniel Marta, Christian Pek, Gaspar Isaac Melsión, Jana Tumova, Iolanda Leite. Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning. IEEE Robotics and Automation Letters, 7(1):406-413, 2022. [doi]

@article{MartaPMTL22,
  title = {Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning},
  author = {Daniel Marta and Christian Pek and Gaspar Isaac Melsión and Jana Tumova and Iolanda Leite},
  year = {2022},
  doi = {10.1109/LRA.2021.3128237},
  url = {https://doi.org/10.1109/LRA.2021.3128237},
  researchr = {https://researchr.org/publication/MartaPMTL22},
  cites = {0},
  citedby = {0},
  journal = {IEEE Robotics and Automation Letters},
  volume = {7},
  number = {1},
  pages = {406-413},
}