High-Confidence Off-Policy (or Counterfactual) Variance Estimation

Yash Chandak, Shiv Shankar, Philip S. Thomas. High-Confidence Off-Policy (or Counterfactual) Variance Estimation. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. pages 6939-6947, AAAI Press, 2021. [doi]

Authors

Yash Chandak

This author has not been identified. Look up 'Yash Chandak' in Google

Shiv Shankar

This author has not been identified. Look up 'Shiv Shankar' in Google

Philip S. Thomas

This author has not been identified. Look up 'Philip S. Thomas' in Google