Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition

Syed Talal Wasim, Muhammad Uzair Khattak, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan. Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 13732-13743, IEEE, 2023. [doi]

Abstract

Abstract is missing.