Anti-Concentrated Confidence Bonuses For Scalable Exploration

Jordan T. Ash, Cyril Zhang, Surbhi Goel, Akshay Krishnamurthy, Sham M. Kakade. Anti-Concentrated Confidence Bonuses For Scalable Exploration. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Jordan T. Ash

This author has not been identified. Look up 'Jordan T. Ash' in Google

Cyril Zhang

This author has not been identified. Look up 'Cyril Zhang' in Google

Surbhi Goel

This author has not been identified. Look up 'Surbhi Goel' in Google

Akshay Krishnamurthy

This author has not been identified. Look up 'Akshay Krishnamurthy' in Google

Sham M. Kakade

This author has not been identified. Look up 'Sham M. Kakade' in Google