MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

Najmeh Sadoughi, Xinyu Li, Avijit Vajpayee, David Fan, Bing Shuai, Hector J. Santos-Villalobos, Vimal Bhat, Rohith MV. MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 23274-23283, IEEE, 2023. [doi]

Abstract

Abstract is missing.