Hierarchical Conditional Relation Networks for Multimodal Video Question Answering

Thao Minh Le, Vuong Le, Svetha Venkatesh, Truyen Tran 0001. Hierarchical Conditional Relation Networks for Multimodal Video Question Answering. International Journal of Computer Vision, 129(11):3027-3050, 2021. [doi]

Abstract

Abstract is missing.