A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action - researchr publication

researchr

You are not signed in
Sign in
Sign up

Takashi Watanabe, Takashi Sakuragawa. A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action. In ICMLSC 2020: The 4th International Conference on Machine Learning and Soft Computing, Haiphong City, Viet Nam, January 17-19, 2020. pages 51-55, ACM, 2020. [doi]

Abstract is missing.

runs on WebDSL