NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning

researchr

explore
calendar
search

You are not signed in
Sign in
Sign up

Sirui Xie, Junning Huang, Lanxin Lei, Chunxiao Liu, Zheng Ma, Wei Zhang 0114, Liang Lin. NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. [doi]

@inproceedings{XieHLLMZL19,
  title = {NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning},
  author = {Sirui Xie and Junning Huang and Lanxin Lei and Chunxiao Liu and Zheng Ma and Wei Zhang 0114 and Liang Lin},
  year = {2019},
  url = {https://openreview.net/forum?id=rkxciiC9tm},
  researchr = {https://researchr.org/publication/XieHLLMZL19},
  cites = {0},
  citedby = {0},
  booktitle = {7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019},
  publisher = {OpenReview.net},
}

External Links

Cite Key

Statistics

PDF

Researchr

NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning