Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning

Jintao Meng, Chen Zhuang, Peng Chen, Mohamed Wahib, Bertil Schmidt, Xiao Wang, Haidong Lan, Dou Wu, Minwen Deng, Yanjie Wei, Shengzhong Feng. Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning. IEEE Trans. Parallel Distrib. Syst., 33(11):2885-2899, 2022. [doi]

Authors

Jintao Meng

This author has not been identified. Look up 'Jintao Meng' in Google

Chen Zhuang

This author has not been identified. Look up 'Chen Zhuang' in Google

Peng Chen

This author has not been identified. Look up 'Peng Chen' in Google

Mohamed Wahib

This author has not been identified. Look up 'Mohamed Wahib' in Google

Bertil Schmidt

This author has not been identified. Look up 'Bertil Schmidt' in Google

Xiao Wang

This author has not been identified. Look up 'Xiao Wang' in Google

Haidong Lan

This author has not been identified. Look up 'Haidong Lan' in Google

Dou Wu

This author has not been identified. Look up 'Dou Wu' in Google

Minwen Deng

This author has not been identified. Look up 'Minwen Deng' in Google

Yanjie Wei

This author has not been identified. Look up 'Yanjie Wei' in Google

Shengzhong Feng

This author has not been identified. Look up 'Shengzhong Feng' in Google