ConCCL: Optimizing ML Concurrent Computation and Communication with GPU DMA Engines

Anirudha Agrawal, Shaizeen Aga, Suchita Pati, Mahzabeen Islam. ConCCL: Optimizing ML Concurrent Computation and Communication with GPU DMA Engines. In IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2025, Ghent, Belgium, May 11-13, 2025. pages 1-11, IEEE, 2025. [doi]

Abstract

Abstract is missing.