Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

Amin Karbasi, Nikki Lijing Kuang, Yi-An Ma, Siddharth Mitra. Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 15828-15860, PMLR, 2023. [doi]

@inproceedings{KarbasiKMM23,
  title = {Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning},
  author = {Amin Karbasi and Nikki Lijing Kuang and Yi-An Ma and Siddharth Mitra},
  year = {2023},
  url = {https://proceedings.mlr.press/v202/karbasi23a.html},
  researchr = {https://researchr.org/publication/KarbasiKMM23},
  cites = {0},
  citedby = {0},
  pages = {15828-15860},
  booktitle = {International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA},
  editor = {Andreas Krause 0001 and Emma Brunskill and KyungHyun Cho and Barbara Engelhardt and Sivan Sabato and Jonathan Scarlett},
  volume = {202},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}