Regret Analysis for RL using Renewal Bandit Feedback

Sujay Bhatt, Guanhua Fang, Ping Li 0001, Gennady Samorodnitsky. Regret Analysis for RL using Renewal Bandit Feedback. In IEEE Information Theory Workshop, ITW 2022, Mumbai, India, November 1-9, 2022. pages 137-142, IEEE, 2022. [doi]

Abstract

Abstract is missing.