CUDA-For-Clusters: A System for Efficient Execution of CUDA Kernels on Multi-core Clusters

Raghu Prabhakar, R. Govindarajan, Matthew J. Thazhuthaveetil. CUDA-For-Clusters: A System for Efficient Execution of CUDA Kernels on Multi-core Clusters. In Christos Kaklamanis, Theodore S. Papatheodorou, Paul G. Spirakis, editors, Euro-Par 2012 Parallel Processing - 18th International Conference, Euro-Par 2012, Rhodes Island, Greece, August 27-31, 2012. Proceedings. Volume 7484 of Lecture Notes in Computer Science, pages 415-426, Springer, 2012. [doi]

Authors

Raghu Prabhakar

This author has not been identified. Look up 'Raghu Prabhakar' in Google

R. Govindarajan

This author has not been identified. Look up 'R. Govindarajan' in Google

Matthew J. Thazhuthaveetil

This author has not been identified. Look up 'Matthew J. Thazhuthaveetil' in Google