VideoXum: Cross-Modal Visual and Textural Summarization of Videos

Jingyang Lin, Hang Hua, Ming Chen, Yikang Li 0001, Jenhao Hsiao, Chiuman Ho, Jiebo Luo. VideoXum: Cross-Modal Visual and Textural Summarization of Videos. IEEE Transactions on Multimedia, 26:5548-5560, 2024. [doi]

Abstract

Abstract is missing.