Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov 0003, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani 0001, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [doi]
Abstract is missing.