Abstract is missing.
- Introducing 'Bones': a parallelizing source-to-source compiler based on algorithmic skeletonsCedric Nugteren, Henk Corporaal. 1-10 [doi]
- A distributed data-parallel framework for analysis and visualization algorithm developmentJeremy S. Meredith, Robert Sisneros, David Pugmire, Sean Ahern. 11-19 [doi]
- FLAT: a GPU programming framework to provide embedded MPITakefumi Miyoshi, Hidetsugu Irie, Keigo Shima, Hiroki Honda, Masaaki Kondo, Tsutomu Yoshinaga. 20-29 [doi]
- A GPU-based high-throughput image retrieval algorithmFeiwen Zhu, Peng Chen, Donglei Yang, Weihua Zhang, Haibo Chen, Binyu Zang. 30-37 [doi]
- Dynamic particle system for mesh extraction on the GPUMark Kim, Guoning Chen, Charles D. Hansen. 38-46 [doi]
- High-performance sparse matrix-vector multiplication on GPUs for structured grid computationsJeswin Godwin, Justin Holewinski, P. Sadayappan. 47-56 [doi]
- High performance 3-D FFT using multiple CUDA GPUsAkira Nukada, Yutaka Maruyama, Satoshi Matsuoka. 57-63 [doi]
- Paragon: collaborative speculative loop execution on GPU and CPUMehrzad Samadi, Amir Hormati, Janghaeng Lee, Scott A. Mahlke. 64-73 [doi]
- JaBEE: framework for object-oriented Java bytecode compilation and execution on graphics processor unitsWojciech Zaremba, Yuan Lin, Vinod Grover. 74-83 [doi]
- Enabling task-level scheduling on heterogeneous platformsEnqiang Sun, Dana Schaa, Richard Bagley, Norman Rubin, David R. Kaeli. 84-93 [doi]
- Auto-tuning interactive ray tracing using an analytical GPU architecture modelPer Ganestam, Michael C. Doggett. 94-100 [doi]
- Full system simulation of many-core heterogeneous SoCs using GPU and QEMU semihostingShivani Raghav, Andrea Marongiu, Christian Pinto, David Atienza, Martino Ruggiero, Luca Benini. 101-109 [doi]
- Reducing off-chip memory traffic by selective cache management scheme in GPGPUsHyojin Choi, Jae-Woo Ahn, Wonyong Sung. 110-119 [doi]