Logarithmic regret of exploration in average reward Markov decision processes - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Victor Boone, Bruno Gaujal. Logarithmic regret of exploration in average reward Markov decision processes. In Nika Haghtalab, Ankur Moitra, editors, The Thirty Eighth Annual Conference on Learning Theory, 30-4 July 2025, Lyon, France. Volume 291 of Proceedings of Machine Learning Research, pages 454-533, PMLR, 2025. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL