MOPO: Model-based Offline Policy Optimization

Tianhe Yu, Garrett Thomas, Lantao Yu, Stefano Ermon, James Y. Zou, Sergey Levine, Chelsea Finn, Tengyu Ma. MOPO: Model-based Offline Policy Optimization. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Tianhe Yu

This author has not been identified. Look up 'Tianhe Yu' in Google

Garrett Thomas

This author has not been identified. Look up 'Garrett Thomas' in Google

Lantao Yu

This author has not been identified. Look up 'Lantao Yu' in Google

Stefano Ermon

This author has not been identified. Look up 'Stefano Ermon' in Google

James Y. Zou

This author has not been identified. Look up 'James Y. Zou' in Google

Sergey Levine

This author has not been identified. Look up 'Sergey Levine' in Google

Chelsea Finn

This author has not been identified. Look up 'Chelsea Finn' in Google

Tengyu Ma

This author has not been identified. Look up 'Tengyu Ma' in Google