Minimax Off-Policy Evaluation for Multi-Armed Bandits

Cong Ma, Banghua Zhu, Jiantao Jiao, Martin J. Wainwright. Minimax Off-Policy Evaluation for Multi-Armed Bandits. IEEE Transactions on Information Theory, 68(8):5314-5339, 2022. [doi]

Authors

Cong Ma

This author has not been identified. Look up 'Cong Ma' in Google

Banghua Zhu

This author has not been identified. Look up 'Banghua Zhu' in Google

Jiantao Jiao

This author has not been identified. Look up 'Jiantao Jiao' in Google

Martin J. Wainwright

This author has not been identified. Look up 'Martin J. Wainwright' in Google