Parallel reward and punishment control in humans and robots: Safe reinforcement learning using the MaxPain algorithm

Stefan Elfwing, Ben Seymour. Parallel reward and punishment control in humans and robots: Safe reinforcement learning using the MaxPain algorithm. In 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2017, Lisbon, Portugal, September 18-21, 2017. pages 140-147, IEEE, 2017. [doi]

Abstract

Abstract is missing.