Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning

Jingfeng Wu, Vladimir Braverman, Lin Yang 0011. Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 13112-13124, 2021. [doi]

Authors

Jingfeng Wu

This author has not been identified. Look up 'Jingfeng Wu' in Google

Vladimir Braverman

This author has not been identified. Look up 'Vladimir Braverman' in Google

Lin Yang 0011

This author has not been identified. Look up 'Lin Yang 0011' in Google