PAC-Bayesian lifelong learning for multi-armed bandits

Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters 0001. PAC-Bayesian lifelong learning for multi-armed bandits. Data Min. Knowl. Discov., 36(2):841-876, 2022. [doi]

Abstract

Abstract is missing.