Survey: Transformer based video-language pre-training

Ludan Ruan, Qin Jin. Survey: Transformer based video-language pre-training. AI Open, 3:1-13, January 2022. [doi]

Abstract

Abstract is missing.