Reward Biased Maximum Likelihood Estimation for Reinforcement Learning

Akshay Mete, Rahul Singh, Xi Liu, P. R. Kumar 0001. Reward Biased Maximum Likelihood Estimation for Reinforcement Learning. In Ali Jadbabaie, John Lygeros, George J. Pappas, Pablo A. Parrilo, Benjamin Recht, Claire J. Tomlin, Melanie N. Zeilinger, editors, Proceedings of the 3rd Annual Conference on Learning for Dynamics and Control, L4DC 2021, 7-8 June 2021, Virtual Event, Switzerland. Volume 144 of Proceedings of Machine Learning Research, pages 815-827, PMLR, 2021. [doi]

Authors

Akshay Mete

This author has not been identified. Look up 'Akshay Mete' in Google

Rahul Singh

This author has not been identified. Look up 'Rahul Singh' in Google

Xi Liu

This author has not been identified. Look up 'Xi Liu' in Google

P. R. Kumar 0001

This author has not been identified. Look up 'P. R. Kumar 0001' in Google