Interpretable ML enhanced CNN Performance Analysis of cuBLAS, cuDNN and TensorRT

Zhumakhan Nazir, Vladislav Yarovenko, Jurn-Gyu Park. Interpretable ML enhanced CNN Performance Analysis of cuBLAS, cuDNN and TensorRT. In Jiman Hong, Maart Lanperne, Juw Won Park, Tomás Cerný, Hossain Shahriar, editors, Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, SAC 2023, Tallinn, Estonia, March 27-31, 2023. pages 1260-1265, ACM, 2023. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.