Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds

Roi Ceren, Prashant Doshi, Bikramjit Banerjee. Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds. In Catholijn M. Jonker, Stacy Marsella, John Thangarajah, Karl Tuyls, editors, Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, Singapore, May 9-13, 2016. pages 530-538, ACM, 2016. [doi]

Authors

Roi Ceren

This author has not been identified. Look up 'Roi Ceren' in Google

Prashant Doshi

This author has not been identified. Look up 'Prashant Doshi' in Google

Bikramjit Banerjee

This author has not been identified. Look up 'Bikramjit Banerjee' in Google