Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?

Mitja Nikolaus, Emmanuelle Salin, Stéphane Ayache, Abdellah Fourtassi, Benoît Favre. Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11. pages 1538-1555, Association for Computational Linguistics, 2022. [doi]

@inproceedings{NikolausSAFF22,
  title = {Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?},
  author = {Mitja Nikolaus and Emmanuelle Salin and Stéphane Ayache and Abdellah Fourtassi and Benoît Favre},
  year = {2022},
  url = {https://aclanthology.org/2022.emnlp-main.100},
  researchr = {https://researchr.org/publication/NikolausSAFF22},
  cites = {0},
  citedby = {0},
  pages = {1538-1555},
  booktitle = {Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11},
  editor = {Yoav Goldberg and Zornitsa Kozareva and Yue Zhang},
  publisher = {Association for Computational Linguistics},
}