Learning Rewards From Linguistic Feedback

Theodore R. Sumers, Mark K. Ho, Robert X. D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths. Learning Rewards From Linguistic Feedback. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. pages 6002-6010, AAAI Press, 2021. [doi]

@inproceedings{SumersHHNG21,
  title = {Learning Rewards From Linguistic Feedback},
  author = {Theodore R. Sumers and Mark K. Ho and Robert X. D. Hawkins and Karthik Narasimhan and Thomas L. Griffiths},
  year = {2021},
  url = {https://ojs.aaai.org/index.php/AAAI/article/view/16749},
  researchr = {https://researchr.org/publication/SumersHHNG21},
  cites = {0},
  citedby = {0},
  pages = {6002-6010},
  booktitle = {Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021},
  publisher = {AAAI Press},
  isbn = {978-1-57735-866-4},
}