Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Zhizhou Ren, Ruihan Guo, Yuan Zhou 0007, Jian Peng 0001. Learning Long-Term Reward Redistribution via Randomized Return Decomposition. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Zhizhou Ren

This author has not been identified. Look up 'Zhizhou Ren' in Google

Ruihan Guo

This author has not been identified. Look up 'Ruihan Guo' in Google

Yuan Zhou 0007

This author has not been identified. Look up 'Yuan Zhou 0007' in Google

Jian Peng 0001

This author has not been identified. Look up 'Jian Peng 0001' in Google