ViViT: A Video Vision Transformer

Anurag Arnab, Mostafa Dehghani 0001, Georg Heigold, Chen Sun 0002, Mario Lucic, Cordelia Schmid. ViViT: A Video Vision Transformer. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 6816-6826, IEEE, 2021. [doi]

Abstract

Abstract is missing.