Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network

Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan 0001. Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network. IEEE Transactions on Neural Networks, 31(10):4374-4380, 2020. [doi]

Abstract

Abstract is missing.