Block Policy Mirror Descent

Guanghui Lan, Yan Li 0074, Tuo Zhao. Block Policy Mirror Descent. SIAM Journal on Optimization, 33(3):2341-2378, September 2023. [doi]

Abstract

Abstract is missing.