Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Zihan Zhang, Xiangyang Ji, Simon S. Du. Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies. In Po-Ling Loh, Maxim Raginsky, editors, Conference on Learning Theory, 2-5 July 2022, London, UK. Volume 178 of Proceedings of Machine Learning Research, pages 3858-3904, PMLR, 2022. [doi]

This author has not been identified. Look up 'Zihan Zhang' in GoogleThis author has not been identified. Look up 'Xiangyang Ji' in GoogleThis author has not been identified. Look up 'Simon S. Du' in Google

runs on WebDSL