Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation

Li Gao, Dong Nie, Bo Li, Xiaofeng Ren. Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation. In Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner, editors, Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXIII. Volume 13683 of Lecture Notes in Computer Science, pages 744-761, Springer, 2022. [doi]

Authors

Li Gao

This author has not been identified. Look up 'Li Gao' in Google

Dong Nie

This author has not been identified. Look up 'Dong Nie' in Google

Bo Li

This author has not been identified. Look up 'Bo Li' in Google

Xiaofeng Ren

This author has not been identified. Look up 'Xiaofeng Ren' in Google