Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence

Weidong Chen, Guorong Li, Xinfeng Zhang, Hongyang Yu, Shuhui Wang, Qingming Huang. Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence. In Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo Cesar, Florian Metze, Balakrishnan Prabhakaran, editors, MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021. pages 4053-4062, ACM, 2021. [doi]

Abstract

Abstract is missing.