Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Zhizhou Ren, Ruihan Guo, Yuan Zhou 0007, Jian Peng 0001. Learning Long-Term Reward Redistribution via Randomized Return Decomposition. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

@inproceedings{RenG0022,
  title = {Learning Long-Term Reward Redistribution via Randomized Return Decomposition},
  author = {Zhizhou Ren and Ruihan Guo and Yuan Zhou 0007 and Jian Peng 0001},
  year = {2022},
  url = {https://openreview.net/forum?id=lpkGn3k2YdD},
  researchr = {https://researchr.org/publication/RenG0022},
  cites = {0},
  citedby = {0},
  booktitle = {The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022},
  publisher = {OpenReview.net},
}