Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Prasenjit Karmakar, Shalabh Bhatnagar. Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning. Math. Oper. Res., 43(1):130-151, 2018. [doi]
Abstract is missing.