End-to-End Video Object Detection with Spatial-Temporal Transformers

Lu He, Qianyu Zhou 0001, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang. End-to-End Video Object Detection with Spatial-Temporal Transformers. In Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo Cesar, Florian Metze, Balakrishnan Prabhakaran, editors, MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021. pages 1507-1516, ACM, 2021. [doi]

Abstract

Abstract is missing.