ATMNet: Adaptive Two-Stage Modular Network for Accurate Video Captioning

Tianyang Xu, Yunjie Zhang, Xiaoning Song, Zheng-Hua Feng, Xiao-Jun Wu 0001. ATMNet: Adaptive Two-Stage Modular Network for Accurate Video Captioning. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 55(4):2821-2833, April 2025. [doi]

Abstract

Abstract is missing.