Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models

Ali Hadi Zadeh, Mostafa Mahmoud, Ameer Abdelhadi, Andreas Moshovos. Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models. In Valentina Salapura, Mohamed Zahran 0001, Fred Chong, Lingjia Tang, editors, ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18 - 22, 2022. pages 888-901, ACM, 2022. [doi]

Abstract

Abstract is missing.