Convolutional Embedding Makes Hierarchical Vision Transformer Stronger

Cong Wang, Hongmin Xu, Xiong Zhang, Li Wang, Zhitong Zheng, Haifeng Liu. Convolutional Embedding Makes Hierarchical Vision Transformer Stronger. In Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner, editors, Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XX. Volume 13680 of Lecture Notes in Computer Science, pages 739-756, Springer, 2022. [doi]

Abstract

Abstract is missing.