Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data

Aleksandra Malysheva, Daniel Kudenko, Aleksei Shpilman. Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data. In 15th International Conference on Control, Automation, Robotics and Vision, ICARCV 2018, Singapore, November 18-21, 2018. pages 286-291, IEEE, 2018. [doi]

Bibliographies