NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning

Sirui Xie, Junning Huang, Lanxin Lei, Chunxiao Liu, Zheng Ma, Wei Zhang 0114, Liang Lin. NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. [doi]

@inproceedings{XieHLLMZL19,
  title = {NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning},
  author = {Sirui Xie and Junning Huang and Lanxin Lei and Chunxiao Liu and Zheng Ma and Wei Zhang 0114 and Liang Lin},
  year = {2019},
  url = {https://openreview.net/forum?id=rkxciiC9tm},
  researchr = {https://researchr.org/publication/XieHLLMZL19},
  cites = {0},
  citedby = {0},
  booktitle = {7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019},
  publisher = {OpenReview.net},
}