Energon: Toward Efficient Acceleration of Transformers Using Dynamic Sparse Attention

Zhe Zhou, Junlin Liu, Zhenyu Gu, Guangyu Sun 0003. Energon: Toward Efficient Acceleration of Transformers Using Dynamic Sparse Attention. IEEE Trans. on CAD of Integrated Circuits and Systems, 42(1):136-149, 2023. [doi]

Abstract

Abstract is missing.