Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim. Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding. In IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023. pages 1181-1186, IEEE, 2023. [doi]

Abstract

Abstract is missing.