Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Jiaqing Cao, Quan Liu, Lan Wu, Qiming Fu 0001, Shan Zhong. Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control. Appl. Intell., 53(18):20917-20937, September 2023. [doi]
Abstract is missing.