Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection

Khurram Azeem Hashmi, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal. Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, AZ, USA, February 26 - March 6, 2025. pages 8122-8133, IEEE, 2025. [doi]

Abstract

Abstract is missing.