Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Arghyadip Roy, Vivek S. Borkar, Abhay Karandikar, Prasanna Chaporkar. Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes. IEEE Trans. Automat. Contr., 67(7):3722-3729, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.