The following publications are possibly variants of this publication:
- Designing a ROCm-Aware MPI Library for AMD GPUs: Early ExperiencesKawthar Shafie Khorassani, Jahanzeb Maqbool Hashmi, Ching-Hsiang Chu, Chen-Chun Chen, Hari Subramoni, Dhabaleswar K. Panda 0001. supercomputer 2021: 118-136 [doi]
- Design and Implementation of an IPC-based Collective MPI Library for Intel GPUsChen-Chun Chen, Goutham Kalikrishna Reddy Kuncham, Pouya Kousha, Hari Subramoni, Dhabaleswar K. Panda 0001. xsede 2024: [doi]
- Performance Evaluation of MPI Libraries on GPU-Enabled OpenPOWER Architectures: Early ExperiencesKawthar Shafie Khorassani, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda. supercomputer 2019: 361-378 [doi]
- Design and Implementation of Kernel-based MPI Reduction Operations for Intel GPU sChen-Chun Chen, Goutham Kalikrishna Reddy Kuncham, Hari Subramoni, Dhabaleswar K. Panda 0001. hipc 2024: 122-131 [doi]
- Network Assisted Non-Contiguous Transfers for GPU-Aware MPI LibrariesKaushik Kandadi Suresh, Kawthar Shafie Khorassani, Chen-Chun Chen, Bharath Ramesh 0005, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda 0001. hoti 2022: 13-20 [doi]
- Network-Assisted Noncontiguous Transfers for GPU-Aware MPI LibrariesKaushik Kandadi Suresh, Kawthar Shafie Khorassani, Chen-Chun Chen, Bharath Ramesh 0005, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda 0001. micro, 43(2):131-139, March - April 2023. [doi]
- A Performance Analysis of GPU-Aware MPI Implementations Over the Slingshot-11 InterconnectMichael Beebe, Rahulkumar Gayatri, Kevin Gott, Adam Lavely, Muhammad Haseeb, Brandon Cook 0001, Yong Chen 0001. hpec 2024: 1-7 [doi]