Guorong Li, Hanhua Ye, Yuankai Qi, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang 0001. Learning Hierarchical Modular Networks for Video Captioning. IEEE Trans. Pattern Anal. Mach. Intell., 46(2):1049-1064, February 2024. [doi]
Abstract is missing.