Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network

Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan 0001. Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network. IEEE Transactions on Neural Networks, 31(10):4374-4380, 2020. [doi]

Authors

Wenjia Meng

This author has not been identified. Look up 'Wenjia Meng' in Google

Qian Zheng

This author has not been identified. Look up 'Qian Zheng' in Google

Long Yang

This author has not been identified. Look up 'Long Yang' in Google

Pengfei Li

This author has not been identified. Look up 'Pengfei Li' in Google

Gang Pan 0001

This author has not been identified. Look up 'Gang Pan 0001' in Google