Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs

Yufan Xu, Qiwei Yuan, Erik Curtis Barton, Rui Li 0033, P. Sadayappan, Aravind Sukumaran-Rajam. Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 252-264, ACM, 2022. [doi]

Abstract

Abstract is missing.