Subhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak. SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. pages 36531-36576, OpenReview.net, 2024. [doi]
Abstract is missing.