VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding

Hu Xu, Gargi Ghosh, Po-Yao Huang 0001, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer. VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. pages 6787-6800, Association for Computational Linguistics, 2021. [doi]

Authors

Hu Xu

This author has not been identified. Look up 'Hu Xu' in Google

Gargi Ghosh

This author has not been identified. Look up 'Gargi Ghosh' in Google

Po-Yao Huang 0001

This author has not been identified. Look up 'Po-Yao Huang 0001' in Google

Dmytro Okhonko

This author has not been identified. Look up 'Dmytro Okhonko' in Google

Armen Aghajanyan

This author has not been identified. Look up 'Armen Aghajanyan' in Google

Florian Metze

This author has not been identified. Look up 'Florian Metze' in Google

Luke Zettlemoyer

This author has not been identified. Look up 'Luke Zettlemoyer' in Google

Christoph Feichtenhofer

This author has not been identified. Look up 'Christoph Feichtenhofer' in Google