Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend W., Junwei Lu, Leo Anthony Celi, Tianxi Cai, Peter Szolovits. Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Aaron Sonabend W.

This author has not been identified. Look up 'Aaron Sonabend W.' in Google

Junwei Lu

This author has not been identified. Look up 'Junwei Lu' in Google

Leo Anthony Celi

This author has not been identified. Look up 'Leo Anthony Celi' in Google

Tianxi Cai

This author has not been identified. Look up 'Tianxi Cai' in Google

Peter Szolovits

This author has not been identified. Look up 'Peter Szolovits' in Google