ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh 0001, Tom Zahavy. ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 25303-25336, PMLR, 2023. [doi]

Authors

Ted Moskovitz

This author has not been identified. Look up 'Ted Moskovitz' in Google

Brendan O'Donoghue

This author has not been identified. Look up 'Brendan O'Donoghue' in Google

Vivek Veeriah

This author has not been identified. Look up 'Vivek Veeriah' in Google

Sebastian Flennerhag

This author has not been identified. Look up 'Sebastian Flennerhag' in Google

Satinder Singh 0001

This author has not been identified. Look up 'Satinder Singh 0001' in Google

Tom Zahavy

This author has not been identified. Look up 'Tom Zahavy' in Google