Minimax Off-Policy Evaluation for Multi-Armed Bandits

Cong Ma, Banghua Zhu, Jiantao Jiao, Martin J. Wainwright. Minimax Off-Policy Evaluation for Multi-Armed Bandits. IEEE Transactions on Information Theory, 68(8):5314-5339, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.