Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards

Amaury Gouverneur, Borja Rodríguez Gálvez, Tobias J. Oechtering, Mikael Skoglund. Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards. In IEEE International Symposium on Information Theory, ISIT 2023, Taipei, Taiwan, June 25-30, 2023. pages 1306-1311, IEEE, 2023. [doi]

Authors

Amaury Gouverneur

This author has not been identified. Look up 'Amaury Gouverneur' in Google

Borja Rodríguez Gálvez

This author has not been identified. Look up 'Borja Rodríguez Gálvez' in Google

Tobias J. Oechtering

This author has not been identified. Look up 'Tobias J. Oechtering' in Google

Mikael Skoglund

This author has not been identified. Look up 'Mikael Skoglund' in Google