A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action

Takashi Watanabe, Takashi Sakuragawa. A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action. In ICMLSC 2020: The 4th International Conference on Machine Learning and Soft Computing, Haiphong City, Viet Nam, January 17-19, 2020. pages 51-55, ACM, 2020. [doi]

Abstract

Abstract is missing.