The following publications are possibly variants of this publication:
- Structure-Aware Cross-Modal Transformer for Depth CompletionLinqing Zhao, Yi Wei, Jiaxin Li, Jie Zhou 0001, Jiwen Lu. TIP, 33:1016-1031, 2024. [doi]
- Efficient Visual Tracking via Hierarchical Cross-Attention TransformerXin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu. eccv 2023: 461-477 [doi]
- Transformer-based Monocular Depth Estimation with Attention SupervisionWenjie Chang, Yueyi Zhang, Zhiwei Xiong. bmvc 2021: 136 [doi]
- CATNet: Convolutional attention and transformer for monocular depth estimationShuai Tang, Tongwei Lu, Xuanxuan Liu, Huabing Zhou, Yanduo Zhang. PR, 145:109982, January 2024. [doi]