Xiaoyu Chen, Jiachen Hu, Lin Yang 0011, Liwei Wang. Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]
Abstract is missing.