The Reward Biased Method: An Optimism based Approach for Reinforcement Learning

Akshay Mete, Rahul Singh 0001, P. R. Kumar 0001. The Reward Biased Method: An Optimism based Approach for Reinforcement Learning. In 59th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2023, Monticello, IL, USA, September 26-29, 2023. pages 1-7, IEEE, 2023. [doi]

Abstract

Abstract is missing.