VinVL+L: Enriching Visual Representation with Location Context in VQA

Jirí Vyskocil, Lukás Picek. VinVL+L: Enriching Visual Representation with Location Context in VQA. In Robert Sablatnig, Florian Kleber, editors, Proceedings of the 26th Computer Vision Winter Workshop (CVWW 2023), Krems a.d. Donau, Austria, February 15-17, 2023. Volume 3349 of CEUR Workshop Proceedings, CEUR-WS.org, 2023. [doi]

@inproceedings{VyskocilP23,
  title = {VinVL+L: Enriching Visual Representation with Location Context in VQA},
  author = {Jirí Vyskocil and Lukás Picek},
  year = {2023},
  url = {http://ceur-ws.org/Vol-3349/paper4.pdf},
  researchr = {https://researchr.org/publication/VyskocilP23},
  cites = {0},
  citedby = {0},
  booktitle = {Proceedings of the 26th Computer Vision Winter Workshop (CVWW 2023), Krems a.d. Donau, Austria, February 15-17, 2023},
  editor = {Robert Sablatnig and Florian Kleber},
  volume = {3349},
  series = {CEUR Workshop Proceedings},
  publisher = {CEUR-WS.org},
}