A Robust Exploration Strategy in Reinforcement Learning Based on Temporal Difference Error

Muhammad Shadi Hajar, Harsha K. Kalutarage, M. Omar Al-Kadri. A Robust Exploration Strategy in Reinforcement Learning Based on Temporal Difference Error. In Haris Aziz 0001, DĂ©bora CorrĂȘa, Tim French 0002, editors, AI 2022: Advances in Artificial Intelligence - 35th Australasian Joint Conference, AI 2022, Perth, WA, Australia, December 5-8, 2022, Proceedings. Volume 13728 of Lecture Notes in Computer Science, pages 789-799, Springer, 2022. [doi]

Abstract

Abstract is missing.