Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning - researchr publication

researchr

You are not signed in
Sign in
Sign up

Amin Karbasi, Nikki Lijing Kuang, Yi-An Ma, Siddharth Mitra. Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 15828-15860, PMLR, 2023. [doi]

Abstract is missing.

runs on WebDSL