Close-to-opimal policies for Markovian bandits. (Politiques quasi-optimales de bandits Markoviens)

Chen Yan. Close-to-opimal policies for Markovian bandits. (Politiques quasi-optimales de bandits Markoviens). PhD thesis, Grenoble Alpes University, France, 2022. [doi]

Abstract

Abstract is missing.