NIOT: A Novel Inference Optimization of Transformers on Modern CPUs

Zining Zhang 0001, Yao Chen, Bingsheng He, Zhenjie Zhang. NIOT: A Novel Inference Optimization of Transformers on Modern CPUs. IEEE Trans. Parallel Distrib. Syst., 34(6):1982-1995, June 2023. [doi]

Authors

Zining Zhang 0001

This author has not been identified. Look up 'Zining Zhang 0001' in Google

Yao Chen

This author has not been identified. Look up 'Yao Chen' in Google

Bingsheng He

This author has not been identified. Look up 'Bingsheng He' in Google

Zhenjie Zhang

This author has not been identified. Look up 'Zhenjie Zhang' in Google