Abstract is missing.
- Optimistic Parallelism on GPUsMin Feng, Rajiv Gupta, Laxmi N. Bhuyan. 3-18 [doi]
- Directive-Based Compilers for GPUsSwapnil Ghike, Ruben Gran, María Jesús Garzarán, David A. Padua. 19-35 [doi]
- GLES: A Practical GPGPU Optimizing Compiler Using Data Sharing and Thread CoarseningZhen Lin, Xiaopeng Gao, Han Wan, Bo Jiang. 36-50 [doi]
- Evaluating Performance Portability of OpenACCAmit Sabne, Putt Sakdhnagool, Seyong Lee, Jeffrey S. Vetter. 51-66 [doi]
- NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming ModelRengan Xu, Xiaonan Tian, Sunita Chandrasekaran, Yonghong Yan 0001, Barbara M. Chapman. 67-81 [doi]
- Understanding Co-run Degradations on Integrated Heterogeneous ProcessorsQi Zhu, Bo Wu, Xipeng Shen, Li Shen, Zhiying Wang. 82-97 [doi]
- Simultaneous Inspection: Hiding the Overhead of Inspector-Executor Style Dynamic ParallelizationDaniel Brinkers, Ronald Veldema, Michael Philippsen. 101-115 [doi]
- Tiled Linear Algebra a System for Parallel Graph AlgorithmsSaeed Maleki, G. Carl Evans, David A. Padua. 116-130 [doi]
- An Approach for Proving the Correctness of Inspector/Executor TransformationsMichael Norrish, Michelle Mills Strout. 131-145 [doi]
- Fast Automatic Heuristic Construction Using Active LearningWilliam F. Ogilvie, Pavlos Petoumenos, Zheng Wang, Hugh Leather. 146-160 [doi]
- Jagged Tiling for Intra-tile Parallelism and Fine-Grain MultithreadingSunil Shrestha, Joseph Manzano, Andrès Márquez, John Feo, Guang R. Gao. 161-175 [doi]
- The stapl Skeleton FrameworkMani Zandifar, Nathan Thomas, Nancy M. Amato, Lawrence Rauchwerger. 176-190 [doi]
- Memory Management Techniques for Exploiting RDMA in PGAS LanguagesBarnaby Dalton, Gabriel Tanase, Michail Alvanos, Gheorghe Almási, Ettore Tiotto. 193-207 [doi]
- Change Detection Based Parallelism Mapping: Exploiting Offline Models and Online AdaptationMurali Krishna Emani, Michael F. P. O'Boyle. 208-223 [doi]
- Automatic Streamization of Image Processing ApplicationsPierre Guillou, Fabien Coelho, François Irigoin. 224-238 [doi]
- Evaluation of Automatic Power Reduction with OSCAR Compiler on Intel Haswell and ARM Cortex-A9 MulticoresTomohiro Hirano, Hideo Yamamoto, Shuhei Iizuka, Kohei Muto, Takashi Goto, Tamami Wake, Hiroki Mikami, Moriyuki Takamura, Keiji Kimura, Hironori Kasahara. 239-252 [doi]
- \pi Abstraction: Parallelism-Aware Array Data Flow Analysis for OpenMPFahed Jubair, Okwan Kwon, Rudolf Eigenmann, Samuel P. Midkiff. 253-267 [doi]
- Static Approximation of MPI Communication Graphs for Optimized Process PlacementAndrew J. McPherson, Vijay Nagarajan, Marcelo Cintra. 268-283 [doi]
- Automatic Parallelism Through Macro Dataflow in MATLABPushkar Ratnalikar, Arun Chauhan. 284-299 [doi]
- Re-Engineering Compiler Transformations to Outperform Database Query OptimizersKristian F. D. Rietveld, Harry A. G. Wijshoff. 300-314 [doi]
- Systematic Debugging of Concurrent Systems Using Coalesced Stack Trace GraphsDiego Caminha B. de Oliveira, Zvonimir Rakamaric, Ganesh Gopalakrishnan, Alan Humphrey, Qingyu Meng, Martin Berzins. 317-331 [doi]
- LightPlay: Efficient Replay with GPUsMin Feng, Farzad Khorasani, Rajiv Gupta, Laxmi N. Bhuyan. 332-347 [doi]
- Exploring and Evaluating Array Layout Restructuring for SIMDizationChristopher Haine, Olivier Aumage, Enguerrand Petit, Denis Barthou. 351-366 [doi]
- Unification of Static and Dynamic Analyses to Enable VectorizationAshay Rane, Rakesh Krishnaiyer, Chris J. Newburn, James Browne, Leonardo Fialho, Zakhar Matveev. 367-381 [doi]
- Efficient Exploitation of Hyper Loop Parallelism in VectorizationShixiong Xu, David Gregg. 382-396 [doi]