Accelerating Sparse Attention with a Reconfigurable Non-volatile Processing-In-Memory Architecture

Qilin Zheng, Shiyu Li, Yitu Wang, Ziru Li, Yiran Chen 0001, Hai Helen Li. Accelerating Sparse Attention with a Reconfigurable Non-volatile Processing-In-Memory Architecture. In 60th ACM/IEEE Design Automation Conference, DAC 2023, San Francisco, CA, USA, July 9-13, 2023. pages 1-6, IEEE, 2023. [doi]

Abstract

Abstract is missing.