NIOT: A Novel Inference Optimization of Transformers on Modern CPUs

Zining Zhang 0001, Yao Chen, Bingsheng He, Zhenjie Zhang. NIOT: A Novel Inference Optimization of Transformers on Modern CPUs. IEEE Trans. Parallel Distrib. Syst., 34(6):1982-1995, June 2023. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.