Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Arghyadip Roy, Vivek S. Borkar, Abhay Karandikar, Prasanna Chaporkar. Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes. IEEE Trans. Automat. Contr., 67(7):3722-3729, 2022. [doi]

Authors

Arghyadip Roy

This author has not been identified. Look up 'Arghyadip Roy' in Google

Vivek S. Borkar

This author has not been identified. Look up 'Vivek S. Borkar' in Google

Abhay Karandikar

This author has not been identified. Look up 'Abhay Karandikar' in Google

Prasanna Chaporkar

This author has not been identified. Look up 'Prasanna Chaporkar' in Google