Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition

Tiancheng Jin, Haipeng Luo. Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Tiancheng Jin

This author has not been identified. Look up 'Tiancheng Jin' in Google

Haipeng Luo

This author has not been identified. Look up 'Haipeng Luo' in Google