Generalized gradient emphasis learning for off-policy evaluation and control with function approximation

Jiaqing Cao, Quan Liu, Lan Wu, Qiming Fu 0001, Shan Zhong. Generalized gradient emphasis learning for off-policy evaluation and control with function approximation. Neural Computing and Applications, 35(32):23599-23616, November 2023. [doi]

Abstract

Abstract is missing.