OPTIMUS: OPTImized matrix MUltiplication Structure for Transformer neural network accelerator

Junki Park, Hyunsung Yoon, Daehyun Ahn, Jungwook Choi, Jae-Joon Kim. OPTIMUS: OPTImized matrix MUltiplication Structure for Transformer neural network accelerator. In Inderjit S. Dhillon, Dimitris S. Papailiopoulos, Vivienne Sze, editors, Proceedings of Machine Learning and Systems 2020, MLSys 2020, Austin, TX, USA, March 2-4, 2020. mlsys.org, 2020. [doi]

Abstract

Abstract is missing.