The Reward Biased Method: An Optimism based Approach for Reinforcement Learning

Akshay Mete, Rahul Singh 0001, P. R. Kumar 0001. The Reward Biased Method: An Optimism based Approach for Reinforcement Learning. In 59th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2023, Monticello, IL, USA, September 26-29, 2023. pages 1-7, IEEE, 2023. [doi]

Authors

Akshay Mete

This author has not been identified. Look up 'Akshay Mete' in Google

Rahul Singh 0001

This author has not been identified. Look up 'Rahul Singh 0001' in Google

P. R. Kumar 0001

This author has not been identified. Look up 'P. R. Kumar 0001' in Google