Number of Attention Heads vs. Number of Transformer-encoders in Computer Vision

Tomas Hrycej, Bernhard Bermeitinger, Siegfried Handschuh. Number of Attention Heads vs. Number of Transformer-encoders in Computer Vision. In Frans Coenen, Ana L. N. Fred, Joaquim Filipe, editors, Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2022, Volume 1: KDIR, Valletta, Malta, October 24-26, 2022. pages 315-321, SCITEPRESS, 2022. [doi]

Abstract

Abstract is missing.