Cross-language multimodal scene semantic guidance and leap sampling for video captioning

Bo Sun 0006, Yong Wu, Yijia Zhao, Zhuo Hao, Lejun Yu, Jun He 0009. Cross-language multimodal scene semantic guidance and leap sampling for video captioning. The Visual Computer, 39(1):9-25, 2023. [doi]

Abstract

Abstract is missing.