Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan 0001, Michael Felsberg, Fahad Shahbaz Khan. Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer. In Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner, editors, Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXIX. Volume 13689 of Lecture Notes in Computer Science, pages 666-681, Springer, 2022. [doi]