The following publications are possibly variants of this publication:
- PVT-Crowd: Bridging Multi-scale Features from Pyramid Vision Transformer for Weakly-Supervised Crowd CountingZhanqiang Huo, Kunwei Zhang, Fen Luo, Yingxu Qiao. prcv 2024: 306-318 [doi]
- RGB-T Multi-Modal Crowd Counting Based on TransformerZhengyi Liu, Wei Wu, Yacheng Tan, Guanghui Zhang. bmvc 2022: 427 [doi]
- U-shaped network based on Transformer for 3D point clouds semantic segmentationJiazhe Zhang, Xingwei Li, Xianfa Zhao, Yizhi Ge, Zheng Zhang. icvip 2021: 170-176 [doi]
- Multi-scale Neighborhood Attention Transformer on U-Net for Medical Image SegmentationNanxing Zhang, Shiqiang Ma, Xuejian Li, Jiahui Zhang, Jijun Tang, Fei Guo 0001. bibm 2022: 1381-1386 [doi]