The following publications are possibly variants of this publication:
- LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement LearningXi Chen, Ali Ghadirzadeh, Tianhe Yu, Jianhao Wang, Alex Yuan Gao, Wenzhe Li, Liang Bin, Chelsea Finn, Chongjie Zhang. nips 2022: [doi]
- Offline Meta-Reinforcement Learning with Online Self-SupervisionVitchyr H. Pong, Ashvin V. Nair, Laura M. Smith, Catherine Huang, Sergey Levine. icml 2022: 17811-17829 [doi]
- Offline Meta-Reinforcement Learning for Industrial InsertionTony Z. Zhao, Jianlan Luo, Oleg Sushkov, Rugile Pevceviciute, Nicolas Heess, Jon Scholz, Stefan Schaal, Sergey Levine. icra 2022: 6386-6393 [doi]
- Context Shift Reduction for Offline Meta-Reinforcement LearningYunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu, Qi Guo, Ling Li, Yunji Chen. nips 2023: [doi]