The following publications are possibly variants of this publication:
- Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced NetworksZhou Zhao, Zhu Zhang, Shuwen Xiao, Zhou Yu, Jun Yu, Deng Cai, Fei Wu, Yueting Zhuang. IJCAI 2018: 3683-3689 [doi]
- Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention NetworksZhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He. IJCAI 2019: 4383-4389 [doi]
- Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced NetworksZhou Zhao, Zhu Zhang, Xinghua Jiang, Deng Cai. TIP, 28(8):3860-3872, 2019. [doi]
- Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive NetworksTing Yu, Jun Yu 0002, Zhou Yu, Qingming Huang, Qi Tian 0001. tcsv, 31(3):931-944, 2021. [doi]
- Video Question Answering via Hierarchical Spatio-Temporal Attention NetworksZhou Zhao, Qifan Yang, Deng Cai, Xiaofei He, Yueting Zhuang. IJCAI 2017: 3518-3524 [doi]
- Video Question Answering via Hierarchical Dual-Level Attention Network LearningZhou Zhao, Jinghao Lin, Xinghua Jiang, Deng Cai, Xiaofei He, Yueting Zhuang. mm 2017: 1050-1058 [doi]
- Video Question Answering via Attribute-Augmented Attention Network LearningYunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang. sigir 2017: 829-832 [doi]
- Hierarchical Conditional Relation Networks for Video Question AnsweringThao Minh Le, Vuong Le, Svetha Venkatesh, Truyen Tran 0001. cvpr 2020: 9969-9978 [doi]
- Multi-Turn Video Question Answering via Multi-Stream Hierarchical Attention Context NetworkZhou Zhao, Xinghua Jiang, Deng Cai, Jun Xiao, Xiaofei He, Shiliang Pu. IJCAI 2018: 3690-3696 [doi]
- Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question AnsweringLianli Gao, Yu Lei, Pengpeng Zeng, Jingkuan Song, Meng Wang 0001, Heng Tao Shen. TIP, 31:202-215, 2022. [doi]
- Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question AnsweringLong Hoang Dang, Thao Minh Le, Vuong Le, Truyen Tran 0001. IJCAI 2021: 636-642 [doi]
- Hierarchical Temporal Fusion of Multi-grained Attention Features for Video Question AnsweringShaoning Xiao, Yimeng Li, Yunan Ye, Long Chen 0016, Shiliang Pu, Zhou Zhao, Jian Shao, Jun Xiao 0001. npl, 52(2):993-1003, 2020. [doi]