Generalized gradient emphasis learning for off-policy evaluation and control with function approximation

Jiaqing Cao, Quan Liu, Lan Wu, Qiming Fu 0001, Shan Zhong. Generalized gradient emphasis learning for off-policy evaluation and control with function approximation. Neural Computing and Applications, 35(32):23599-23616, November 2023. [doi]

Authors

Jiaqing Cao

This author has not been identified. Look up 'Jiaqing Cao' in Google

Quan Liu

This author has not been identified. Look up 'Quan Liu' in Google

Lan Wu

This author has not been identified. Look up 'Lan Wu' in Google

Qiming Fu 0001

This author has not been identified. Look up 'Qiming Fu 0001' in Google

Shan Zhong

This author has not been identified. Look up 'Shan Zhong' in Google