CUDA-For-Clusters: A System for Efficient Execution of CUDA Kernels on Multi-core Clusters

Raghu Prabhakar, R. Govindarajan, Matthew J. Thazhuthaveetil. CUDA-For-Clusters: A System for Efficient Execution of CUDA Kernels on Multi-core Clusters. In Christos Kaklamanis, Theodore S. Papatheodorou, Paul G. Spirakis, editors, Euro-Par 2012 Parallel Processing - 18th International Conference, Euro-Par 2012, Rhodes Island, Greece, August 27-31, 2012. Proceedings. Volume 7484 of Lecture Notes in Computer Science, pages 415-426, Springer, 2012. [doi]

Abstract

Abstract is missing.