Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs

Yufan Xu, Qiwei Yuan, Erik Curtis Barton, Rui Li 0033, P. Sadayappan, Aravind Sukumaran-Rajam. Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 252-264, ACM, 2022. [doi]

Authors

Yufan Xu

This author has not been identified. Look up 'Yufan Xu' in Google

Qiwei Yuan

This author has not been identified. Look up 'Qiwei Yuan' in Google

Erik Curtis Barton

This author has not been identified. Look up 'Erik Curtis Barton' in Google

Rui Li 0033

This author has not been identified. Look up 'Rui Li 0033' in Google

P. Sadayappan

This author has not been identified. It may be one of the following persons: Look up 'P. Sadayappan' in Google

Aravind Sukumaran-Rajam

This author has not been identified. Look up 'Aravind Sukumaran-Rajam' in Google