Arghyadip Roy, Vivek S. Borkar, Abhay Karandikar, Prasanna Chaporkar. Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes. IEEE Trans. Automat. Contr., 67(7):3722-3729, 2022. [doi]
No references recorded for this publication.
No citations of this publication recorded.