Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Arghyadip Roy, Vivek S. Borkar, Abhay Karandikar, Prasanna Chaporkar. Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes. IEEE Trans. Automat. Contr., 67(7):3722-3729, 2022. [doi]

Abstract

Abstract is missing.