Exploiting Hierarchical Parallelism and Reusability in Tensor Kernel Processing on Heterogeneous HPC Systems

Yuedan Chen, Guoqing Xiao 0001, M. Tamer Özsu, Zhuo Tang, Albert Y. Zomaya, Kenli Li 0001. Exploiting Hierarchical Parallelism and Reusability in Tensor Kernel Processing on Heterogeneous HPC Systems. In 38th IEEE International Conference on Data Engineering, ICDE 2022, Kuala Lumpur, Malaysia, May 9-12, 2022. pages 2522-2535, IEEE, 2022. [doi]

Abstract

Abstract is missing.