Scalable Safe Policy Improvement via Monte Carlo Tree Search

Alberto Castellini, Federico Bianchi 0002, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T. J. Spaan. Scalable Safe Policy Improvement via Monte Carlo Tree Search. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 3732-3756, PMLR, 2023. [doi]

Authors

Alberto Castellini

This author has not been identified. Look up 'Alberto Castellini' in Google

Federico Bianchi 0002

This author has not been identified. Look up 'Federico Bianchi 0002' in Google

Edoardo Zorzi

This author has not been identified. Look up 'Edoardo Zorzi' in Google

Thiago D. Simão

This author has not been identified. Look up 'Thiago D. Simão' in Google

Alessandro Farinelli

This author has not been identified. Look up 'Alessandro Farinelli' in Google

Matthijs T. J. Spaan

This author has not been identified. Look up 'Matthijs T. J. Spaan' in Google