VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning

Jiayi Guan, Guang Chen 0001, Jiaming Ji, Long Yang, Ao Zhou, Zhijun Li, Changjun Jiang. VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Jiayi Guan

This author has not been identified. Look up 'Jiayi Guan' in Google

Guang Chen 0001

This author has not been identified. Look up 'Guang Chen 0001' in Google

Jiaming Ji

This author has not been identified. Look up 'Jiaming Ji' in Google

Long Yang

This author has not been identified. Look up 'Long Yang' in Google

Ao Zhou

This author has not been identified. Look up 'Ao Zhou' in Google

Zhijun Li

This author has not been identified. Look up 'Zhijun Li' in Google

Changjun Jiang

This author has not been identified. Look up 'Changjun Jiang' in Google