A 28nm 64.5TOPS/W Sparse Transformer Accelerator with Partial Product-based Speculation and Sparsity-Adaptive Computation

Ming-Guang Lin, Jiing-Ping Wang, Yuan-June Luo, An-Yeu Andy Wu. A 28nm 64.5TOPS/W Sparse Transformer Accelerator with Partial Product-based Speculation and Sparsity-Adaptive Computation. In IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2024, Taipei, Taiwan, November 7-9, 2024. pages 664-668, IEEE, 2024. [doi]

Abstract

Abstract is missing.