LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling

Kaijing Ma, Xianghao Zang, Zerun Feng, Han Fang, Chao Ban, Yuhan Wei, Zhongjiang He, Yongxiang Li, Hao Sun. LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling. In IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023. pages 2790-2795, IEEE, 2023. [doi]

Abstract

Abstract is missing.