InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao. InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 14408-14419, IEEE, 2023. [doi]

@inproceedings{WangDCHLZHLLLWQ23,
  title = {InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions},
  author = {Wenhai Wang and Jifeng Dai and Zhe Chen and Zhenhang Huang and Zhiqi Li and Xizhou Zhu and Xiaowei Hu and Tong Lu and Lewei Lu and Hongsheng Li and Xiaogang Wang and Yu Qiao},
  year = {2023},
  doi = {10.1109/CVPR52729.2023.01385},
  url = {https://doi.org/10.1109/CVPR52729.2023.01385},
  researchr = {https://researchr.org/publication/WangDCHLZHLLLWQ23},
  cites = {0},
  citedby = {0},
  pages = {14408-14419},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0129-8},
}