The following publications are possibly variants of this publication:
- TSPTQ-ViT: Two-Scaled Post-Training Quantization for Vision TransformerYu-Shan Tai, Ming-Guang Lin, An-Yeu Andy Wu. icassp 2023: 1-5 [doi]
- Pyramid Adversarial Training Improves ViT PerformanceCharles Herrmann, Kyle Sargent, Lu Jiang, Ramin Zabih, Huiwen Chang, Ce Liu, Dilip Krishnan, Deqing Sun. cvpr 2022: 13409-13419 [doi]
- Vits-Based Singing Voice Conversion Leveraging Whisper and Multi-Scale F0 ModelingZiqian Ning, Yuepeng Jiang, Zhichao Wang, Bin Zhang, Lei Xie 0001. asru 2023: 1-8 [doi]
- A Finger Vein Liveness Detection System Based on Multi-Scale Spatial-Temporal Map and Light-ViT ModelLiukui Chen, Tengwen Guo, Li Li, Haiyang Jiang, Wenfu Luo, Zuojin Li. sensors, 23(24):9637, December 2023. [doi]
- AiluRus: A Scalable ViT Framework for Dense PredictionJin Li, Yaoming Wang, Xiaopeng Zhang, Bowen Shi, Dongsheng Jiang, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian. nips 2023: [doi]