CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters

Ching-Hsiang Chu, Khaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, Dhabaleswar K. Panda. CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters. In IEEE/ACM 16th International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2016, Cartagena, Colombia, May 16-19, 2016. pages 726-735, IEEE Computer Society, 2016. [doi]

Abstract

Abstract is missing.