Zahra Vaseqi, Pengnan Fan, James Clark, Martin Levine. A Framework for Video-Text Retrieval with Noisy Supervision. In Raj Tumuluri, Nicu Sebe, Gopal Pingali, Dinesh Babu Jayagopi, Abhinav Dhall, Richa Singh 0001, Lisa Anthony, Albert Ali Salah, editors, International Conference on Multimodal Interaction, ICMI 2022, Bengaluru, India, November 7-11, 2022. pages 373-383, ACM, 2022. [doi]
Abstract is missing.