TVQA: Localized, Compositional Video Question Answering

Jie Lei, Licheng Yu, Mohit Bansal, Tamara L. Berg. TVQA: Localized, Compositional Video Question Answering. In Ellen Riloff, David Chiang 0001, Julia Hockenmaier, Jun'ichi Tsujii, editors, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018. pages 1369-1379, Association for Computational Linguistics, 2018. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: