Contextual Bandits and Imitation Learning with Preference-Based Active Queries

Ayush Sekhari, Karthik Sridharan, Wen Sun 0002, Runzhe Wu. Contextual Bandits and Imitation Learning with Preference-Based Active Queries. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Ayush Sekhari

This author has not been identified. Look up 'Ayush Sekhari' in Google

Karthik Sridharan

This author has not been identified. Look up 'Karthik Sridharan' in Google

Wen Sun 0002

This author has not been identified. Look up 'Wen Sun 0002' in Google

Runzhe Wu

This author has not been identified. Look up 'Runzhe Wu' in Google