Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Yijie Guo, Jongwook Choi, Marcin Moczulski, Shengyu Feng, Samy Bengio, Mohammad Norouzi 0002, Honglak Lee. Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Yijie Guo

This author has not been identified. Look up 'Yijie Guo' in Google

Jongwook Choi

This author has not been identified. Look up 'Jongwook Choi' in Google

Marcin Moczulski

This author has not been identified. Look up 'Marcin Moczulski' in Google

Shengyu Feng

This author has not been identified. Look up 'Shengyu Feng' in Google

Samy Bengio

This author has not been identified. Look up 'Samy Bengio' in Google

Mohammad Norouzi 0002

This author has not been identified. Look up 'Mohammad Norouzi 0002' in Google

Honglak Lee

This author has not been identified. Look up 'Honglak Lee' in Google