John A. Stratton, Sam S. Stone, Wen-mei W. Hwu. MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs. In José Nelson Amaral, editor, Languages and Compilers for Parallel Computing, 21th International Workshop, LCPC 2008, Edmonton, Canada, July 31 - August 2, 2008, Revised Selected Papers. Volume 5335 of Lecture Notes in Computer Science, pages 16-30, Springer, 2008. [doi]
Abstract is missing.