Text-Guided Object Detector for Multi-modal Video Question Answering

Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda. Text-Guided Object Detector for Multi-modal Video Question Answering. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, Waikoloa, HI, USA, January 2-7, 2023. pages 1032-1042, IEEE, 2023. [doi]

Abstract

Abstract is missing.