The following publications are possibly variants of this publication:
- End-to-End Temporal Action Detection With TransformerXiaolong Liu, Qimeng Wang, Yao Hu, Xu Tang, Shiwei Zhang, Song Bai, Xiang Bai. TIP, 31:5427-5441, 2022. [doi]
- End-to-end video text detection with online trackingHongyuan Yu, Yan Huang 0008, Lihong Pi, Chengquan Zhang, Xuan Li, Liang Wang. PR, 113:107791, 2021. [doi]
- VADOI: Voice-Activity-Detection Overlapping Inference for End-To-End Long-Form Speech RecognitionJinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas. icassp 2022: 6977-6981 [doi]
- Watch Only Once: An End-to-End Video Action Detection FrameworkShoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo 0002. iccv 2021: 8158-8167 [doi]