TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition

Lizhi Xiang, Miao Yin, Chengming Zhang 0006, Aravind Sukumaran-Rajam, P. Sadayappan, Bo Yuan 0001, Dingwen Tao. TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition. In Maryam Mehri Dehnavi, Milind Kulkarni 0001, Sriram Krishnamoorthy, editors, Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP 2023, Montreal, QC, Canada, 25 February 2023 - 1 March 2023. pages 260-273, ACM, 2023. [doi]

Authors

Lizhi Xiang

This author has not been identified. Look up 'Lizhi Xiang' in Google

Miao Yin

This author has not been identified. Look up 'Miao Yin' in Google

Chengming Zhang 0006

This author has not been identified. Look up 'Chengming Zhang 0006' in Google

Aravind Sukumaran-Rajam

This author has not been identified. Look up 'Aravind Sukumaran-Rajam' in Google

P. Sadayappan

This author has not been identified. It may be one of the following persons: Look up 'P. Sadayappan' in Google

Bo Yuan 0001

This author has not been identified. Look up 'Bo Yuan 0001' in Google

Dingwen Tao

This author has not been identified. Look up 'Dingwen Tao' in Google