An Efficient Transformer Inference Engine on DSP

Kangkang Chen, Huayou Su, Chaorun Liu, Xiaoli Gong. An Efficient Transformer Inference Engine on DSP. In Weizhi Meng 0001, Rongxing Lu, Geyong Min, Jaideep Vaidya, editors, Algorithms and Architectures for Parallel Processing - 22nd International Conference, ICA3PP 2022, Copenhagen, Denmark, October 10-12, 2022, Proceedings. Volume 13777 of Lecture Notes in Computer Science, pages 548-567, Springer, 2022. [doi]

Abstract

Abstract is missing.