Interpretable ML enhanced CNN Performance Analysis of cuBLAS, cuDNN and TensorRT

Zhumakhan Nazir, Vladislav Yarovenko, Jurn-Gyu Park. Interpretable ML enhanced CNN Performance Analysis of cuBLAS, cuDNN and TensorRT. In Jiman Hong, Maart Lanperne, Juw Won Park, Tomás Cerný, Hossain Shahriar, editors, Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, SAC 2023, Tallinn, Estonia, March 27-31, 2023. pages 1260-1265, ACM, 2023. [doi]

Abstract

Abstract is missing.