Text-Guided Object Detector for Multi-modal Video Question Answering

Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda. Text-Guided Object Detector for Multi-modal Video Question Answering. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, Waikoloa, HI, USA, January 2-7, 2023. pages 1032-1042, IEEE, 2023. [doi]

Authors

Ruoyue Shen

This author has not been identified. Look up 'Ruoyue Shen' in Google

Nakamasa Inoue

This author has not been identified. Look up 'Nakamasa Inoue' in Google

Koichi Shinoda

This author has not been identified. Look up 'Koichi Shinoda' in Google