The following publications are possibly variants of this publication:
- Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerceYanhao Zhang, Qiang Wang, Pan Pan, Yun Zheng, Cheng Da, Siyang Sun, Yinghui Xu. AAAI 2021: 16127-16128 [doi]
- UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight DetectionYe Liu, Siyuan Li, Yang Wu, Chang Wen Chen, Ying Shan, Xiaohu Qie. cvpr 2022: 3032-3041 [doi]
- User preference-aware video highlight detection via deep reinforcement learningHan Wang, Kexin Wang, Yuqing Wu, ZhongZhi Wang, Ling Zou. mta, 79(21-22):15015-15024, 2020. [doi]
- Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in MoviesBei Gan, Xiujun Shu, Ruizhi Qiao, Haoqian Wu, Keyu Chen, Hanjun Li, Bo Ren 0002. cvpr 2023: 18898-18907 [doi]