Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA

Hyounghun Kim, Zineng Tang, Mohit Bansal. Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 4812-4822, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.