Quantized Transformer Language Model Implementations on Edge Devices

Mohammad Wali Ur Rahman, Murad Mehrab Abrar, Hunter Gibbons Copening, Salim Hariri, Sicong Shao, Pratik Satam, Soheil Salehi. Quantized Transformer Language Model Implementations on Edge Devices. In International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, FL, USA, December 15-17, 2023. pages 709-716, IEEE, 2023. [doi]

Abstract

Abstract is missing.