Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning

Georgios Kotsalis, Guanghui Lan, Tianjiao Li. Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning. SIAM Journal on Optimization, 32(2):1120-1155, 2022. [doi]

Abstract

Abstract is missing.