NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning

Sirui Xie, Junning Huang, Lanxin Lei, Chunxiao Liu, Zheng Ma, Wei Zhang 0114, Liang Lin. NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. [doi]

Abstract

Abstract is missing.