Textual supervision for visually grounded spoken language understanding

Bertrand Higy, Desmond Elliott, Grzegorz Chrupala. Textual supervision for visually grounded spoken language understanding. In Trevor Cohn, Yulan He, Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, EMNLP 2020, Online Event, 16-20 November 2020. pages 2698-2709, Association for Computational Linguistics, 2020. [doi]

@inproceedings{HigyEC20,
  title = {Textual supervision for visually grounded spoken language understanding},
  author = {Bertrand Higy and Desmond Elliott and Grzegorz Chrupala},
  year = {2020},
  url = {https://www.aclweb.org/anthology/2020.findings-emnlp.244/},
  researchr = {https://researchr.org/publication/HigyEC20},
  cites = {0},
  citedby = {0},
  pages = {2698-2709},
  booktitle = {Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, EMNLP 2020, Online Event, 16-20 November 2020},
  editor = {Trevor Cohn and Yulan He and Yang Liu},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-952148-90-3},
}