Defining admissible rewards for high-confidence policy evaluation in batch reinforcement learning

Niranjani Prasad, Barbara E. Engelhardt, Finale Doshi-Velez. Defining admissible rewards for high-confidence policy evaluation in batch reinforcement learning. In Marzyeh Ghassemi, editor, ACM CHIL '20: ACM Conference on Health, Inference, and Learning, Toronto, Ontario, Canada, April 2-4, 2020 [delayed]. pages 1-9, ACM, 2020. [doi]

Authors

Niranjani Prasad

This author has not been identified. Look up 'Niranjani Prasad' in Google

Barbara E. Engelhardt

This author has not been identified. Look up 'Barbara E. Engelhardt' in Google

Finale Doshi-Velez

This author has not been identified. Look up 'Finale Doshi-Velez' in Google