Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Shipra Agrawal 0001, Randy Jia. Optimistic Posterior Sampling for Reinforcement Learning: Worst-Case Regret Bounds. Math. Oper. Res., 48(1):363-392, February 2023. [doi]
Possibly Related PublicationsThe following publications are possibly variants of this publication: Optimistic posterior sampling for reinforcement learning: worst-case regret boundsShipra Agrawal, Randy Jia. nips 2017: 1184-1194 [doi] Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case RegretHan Zhong 0001, Jiachen Hu, Yecheng Xue, Tongyang Li, Liwei Wang 0001. icml 2024: [doi]
The following publications are possibly variants of this publication: