The following publications are possibly variants of this publication:
- SViTT: Temporal Learning of Sparse Video-Text TransformersYi Li, Kyle Min 0001, Subarna Tripathi, Nuno Vasconcelos. cvpr 2023: 18919-18929 [doi]
- Temporally Efficient Vision Transformer for Video Instance SegmentationShusheng Yang, Xinggang Wang, Yu Li 0003, Yuxin Fang, Jiemin Fang, Wenyu Liu 0001, Xun Zhao, Ying Shan. cvpr 2022: 2875-2885 [doi]
- Temporally Distributed Networks for Fast Video Semantic SegmentationPing Hu, Fabian Caba, Oliver Wang, Zhe Lin, Stan Sclaroff, Federico Perazzi. cvpr 2020: 8815-8824 [doi]
- Capturing the spatio-temporal continuity for video semantic segmentationXin Chen, Aming Wu, Yahong Han. iet-ipr, 13(14):2813-2820, 2019. [doi]
- Temporal Memory Attention for Video Semantic SegmentationHao Wang, Weining Wang, Jing Liu 0001. icip 2021: 2254-2258 [doi]