Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Arghyadip Roy, Vivek S. Borkar, Abhay Karandikar, Prasanna Chaporkar. Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes. IEEE Trans. Automat. Contr., 67(7):3722-3729, 2022. [doi]

This author has not been identified. Look up 'Arghyadip Roy' in GoogleThis author has not been identified. Look up 'Vivek S. Borkar' in GoogleThis author has not been identified. Look up 'Abhay Karandikar' in GoogleThis author has not been identified. Look up 'Prasanna Chaporkar' in Google

runs on WebDSL