Transformer in Transformer

Kai Han 0002, An Xiao, Enhua Wu, Jianyuan Guo, Chunjing Xu, Yunhe Wang 0001. Transformer in Transformer. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 15908-15919, 2021. [doi]

@inproceedings{HanXWGXW21,
  title = {Transformer in Transformer},
  author = {Kai Han 0002 and An Xiao and Enhua Wu and Jianyuan Guo and Chunjing Xu and Yunhe Wang 0001},
  year = {2021},
  url = {https://proceedings.neurips.cc/paper/2021/hash/854d9fca60b4bd07f9bb215d59ef5561-Abstract.html},
  researchr = {https://researchr.org/publication/HanXWGXW21},
  cites = {0},
  citedby = {0},
  pages = {15908-15919},
  booktitle = {Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual},
  editor = {Marc'Aurelio Ranzato and Alina Beygelzimer and Yann N. Dauphin and Percy Liang and Jennifer Wortman Vaughan},
}