Teachable Reinforcement Learning via Advice Distillation

Olivia Watkins, Abhishek Gupta 0004, Trevor Darrell, Pieter Abbeel, Jacob Andreas. Teachable Reinforcement Learning via Advice Distillation. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 6920-6933, 2021. [doi]

@inproceedings{WatkinsGDAA21,
  title = {Teachable Reinforcement Learning via Advice Distillation},
  author = {Olivia Watkins and Abhishek Gupta 0004 and Trevor Darrell and Pieter Abbeel and Jacob Andreas},
  year = {2021},
  url = {https://proceedings.neurips.cc/paper/2021/hash/37cfff3c04f95b22bcf166df586cd7a9-Abstract.html},
  researchr = {https://researchr.org/publication/WatkinsGDAA21},
  cites = {0},
  citedby = {0},
  pages = {6920-6933},
  booktitle = {Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual},
  editor = {Marc'Aurelio Ranzato and Alina Beygelzimer and Yann N. Dauphin and Percy Liang and Jennifer Wortman Vaughan},
}