Online regret bounds for Markov decision processes with deterministic transitions

Ronald Ortner. Online regret bounds for Markov decision processes with deterministic transitions. Theoretical Computer Science, 411(29-30):2684-2695, 2010. [doi]

Abstract

Abstract is missing.