Thompson sampling for Markov games with piecewise stationary opponent policies

Anthony DiGiovanni, Ambuj Tewari. Thompson sampling for Markov games with piecewise stationary opponent policies. In Cassio P. de Campos, Marloes H. Maathuis, Erik Quaeghebeur, editors, Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, UAI 2021, Virtual Event, 27-30 July 2021. Volume 161 of Proceedings of Machine Learning Research, pages 738-748, AUAI Press, 2021. [doi]

Authors

Anthony DiGiovanni

This author has not been identified. Look up 'Anthony DiGiovanni' in Google

Ambuj Tewari

This author has not been identified. Look up 'Ambuj Tewari' in Google