CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters

Ching-Hsiang Chu, Khaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, Dhabaleswar K. Panda. CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters. In IEEE/ACM 16th International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2016, Cartagena, Colombia, May 16-19, 2016. pages 726-735, IEEE Computer Society, 2016. [doi]

@inproceedings{ChuHVAP16,
  title = {CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters},
  author = {Ching-Hsiang Chu and Khaled Hamidouche and Akshay Venkatesh and Ammar Ahmad Awan and Dhabaleswar K. Panda},
  year = {2016},
  doi = {10.1109/CCGrid.2016.111},
  url = {http://doi.ieeecomputersociety.org/10.1109/CCGrid.2016.111},
  researchr = {https://researchr.org/publication/ChuHVAP16},
  cites = {0},
  citedby = {0},
  pages = {726-735},
  booktitle = {IEEE/ACM 16th International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2016, Cartagena, Colombia, May 16-19, 2016},
  publisher = {IEEE Computer Society},
  isbn = {978-1-5090-2453-7},
}