Teachable Reinforcement Learning via Advice Distillation

Olivia Watkins, Abhishek Gupta 0004, Trevor Darrell, Pieter Abbeel, Jacob Andreas. Teachable Reinforcement Learning via Advice Distillation. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 6920-6933, 2021. [doi]

Authors

Olivia Watkins

This author has not been identified. Look up 'Olivia Watkins' in Google

Abhishek Gupta 0004

This author has not been identified. Look up 'Abhishek Gupta 0004' in Google

Trevor Darrell

This author has not been identified. Look up 'Trevor Darrell' in Google

Pieter Abbeel

This author has not been identified. Look up 'Pieter Abbeel' in Google

Jacob Andreas

This author has not been identified. Look up 'Jacob Andreas' in Google