Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

Shuang Qiu, Xiaohan Wei, Zhuoran Yang, Jieping Ye, Zhaoran Wang. Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Shuang Qiu

This author has not been identified. Look up 'Shuang Qiu' in Google

Xiaohan Wei

This author has not been identified. Look up 'Xiaohan Wei' in Google

Zhuoran Yang

This author has not been identified. Look up 'Zhuoran Yang' in Google

Jieping Ye

This author has not been identified. Look up 'Jieping Ye' in Google

Zhaoran Wang

This author has not been identified. Look up 'Zhaoran Wang' in Google