Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs

Partha Maji, Andrew Mundy, Ganesh Dasika, Jesse G. Beu, Matthew Mattina, Robert D. Mullins. Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs. In 2nd Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications, EMC2@HPCA 2019, Washington, DC, USA, February 17, 2019. pages 1-5, IEEE, 2019. [doi]

Authors

Partha Maji

This author has not been identified. Look up 'Partha Maji' in Google

Andrew Mundy

This author has not been identified. Look up 'Andrew Mundy' in Google

Ganesh Dasika

This author has not been identified. Look up 'Ganesh Dasika' in Google

Jesse G. Beu

This author has not been identified. Look up 'Jesse G. Beu' in Google

Matthew Mattina

This author has not been identified. Look up 'Matthew Mattina' in Google

Robert D. Mullins

This author has not been identified. Look up 'Robert D. Mullins' in Google