Transformer in Transformer

Kai Han 0002, An Xiao, Enhua Wu, Jianyuan Guo, Chunjing Xu, Yunhe Wang 0001. Transformer in Transformer. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 15908-15919, 2021. [doi]

Abstract

Abstract is missing.