Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs

Partha Maji, Andrew Mundy, Ganesh Dasika, Jesse G. Beu, Matthew Mattina, Robert D. Mullins. Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs. In 2nd Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications, EMC2@HPCA 2019, Washington, DC, USA, February 17, 2019. pages 1-5, IEEE, 2019. [doi]

Abstract

Abstract is missing.