NIOT: A Novel Inference Optimization of Transformers on Modern CPUs

Zining Zhang 0001, Yao Chen, Bingsheng He, Zhenjie Zhang. NIOT: A Novel Inference Optimization of Transformers on Modern CPUs. IEEE Trans. Parallel Distrib. Syst., 34(6):1982-1995, June 2023. [doi]

Abstract

Abstract is missing.