Online Reinforcement Learning for Mixed Policy Scopes

Junzhe Zhang, Elias Bareinboim. Online Reinforcement Learning for Mixed Policy Scopes. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Junzhe Zhang

This author has not been identified. Look up 'Junzhe Zhang' in Google

Elias Bareinboim

This author has not been identified. Look up 'Elias Bareinboim' in Google