Abstract is missing.
- Streaming Task ParallelismAlbert Cohen. 1  [doi]
 - Automatic Parallelization of Kernels in Shared-Memory Multi-GPU NodesJavier Cabezas, Lluís Vilanova, Isaac Gelado, Thomas B. Jablin, Nacho Navarro, Wen-mei W. Hwu. 3-13  [doi]
 - A Stall-Aware Warp Scheduling for Dynamically Optimizing Thread-level Parallelism in GPGPUsYulong Yu, Weijun Xiao, Xubin He, He Guo, Yuxin Wang, Xin Chen. 15-24  [doi]
 - PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM SynchronizationMehmet E. Belviranli, Peng Deng, Laxmi N. Bhuyan, Rajiv Gupta, Qi Zhu. 25-35  [doi]
 - PaCMap: Topology Mapping of Unstructured Communication Patterns onto Non-contiguous AllocationsOzan Tuncer, Vitus J. Leung, Ayse Kivilcim Coskun. 37-46  [doi]
 - Unique Worker model for OpenMPRaghesh Aloor, V. Krishna Nandivada. 47-56  [doi]
 - Exploiting Process Imbalance to Improve MPI Collective Operations in Hierarchical SystemsBenjamin S. Parsons, Vijay S. Pai. 57-66  [doi]
 - Locality-Driven Dynamic GPU Cache BypassingChao Li, Shuaiwen Leon Song, Hongwen Dai, Albert Sidelnik, Siva Kumar Sastry Hari, Huiyang Zhou. 67-77  [doi]
 - SemCache++: Semantics-Aware Caching for Efficient Multi-GPU OffloadingNabeel AlSaber, Milind Kulkarni. 79-88  [doi]
 - DaCache: Memory Divergence-Aware GPU Cache ManagementBin Wang, Weikuan Yu, Xian-He Sun, Xinning Wang. 89-98  [doi]
 - Automatic Selection of Sparse Matrix Representation on GPUsNaser Sedaghati, Te Mu, Louis-Noël Pouchet, Srinivasan Parthasarathy, P. Sadayappan. 99-108  [doi]
 - Fine-Grained Synchronizations and Dataflow Programming on GPUsAng Li, Gert-Jan van den Braak, Henk Corporaal, Akash Kumar. 109-118  [doi]
 - Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program TransformationsBo Wu, Guoyang Chen, Dong Li, Xipeng Shen, Jeffrey S. Vetter. 119-130  [doi]
 - Datacenter Efficiency: What's Next?Ricardo Bianchini. 131  [doi]
 - Towards Lightweight and Swift Storage Resource Management in Big Data Cloud EraRuijin Zhou, Huixiang Chen, Tao Li. 133-142  [doi]
 - Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous ClustersWenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang, Youliang Yan. 143-153  [doi]
 - Active Access: A Mechanism for High-Performance Distributed Data-Centric ComputationsMaciej Besta, Torsten Hoefler. 155-164  [doi]
 - Exascaling Your Library: Will Your Implementation Meet Your Expectations?Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf. 165-175  [doi]
 - MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous ArchitecturesTobias Gysi, Tobias Grosser, Torsten Hoefler. 177-186  [doi]
 - FAST: A Fast Stencil Autotuning Framework Based On An Optimal-solution Space ModelYulong Luo, Guangming Tan, Zeyao Mo, Ninghui Sun. 187-196  [doi]
 - Parameterized Diamond Tiling for Stencil Computations with Chapel parallel iteratorsIan J. Bertolacci, Catherine Olschanowsky, Ben Harshbarger, Bradford L. Chamberlain, David G. Wonnacott, Michelle Mills Strout. 197-206  [doi]
 - Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory ModelHolger Stengel, Jan Treibig, Georg Hager, Gerhard Wellein. 207-216  [doi]
 - GreenPar: Scheduling Parallel High Performance Applications in Green DatacentersMd. E. Haque, Iñigo Goiri, Ricardo Bianchini, Thu D. Nguyen. 217-227  [doi]
 - Underprovisioning the Grid Power Infrastructure for Green DatacentersXu Zhou, Qiang Cao, Hong Jiang, Changsheng Xie. 229-240  [doi]
 - Building Fuel Powered Supercomputing Data Center at Low CostYiqing Hua, Chao Li, Weichao Tang, Li Jiang, Xiaoyao Liang. 241-250  [doi]
 - History-Assisted Adaptive-Granularity Caches (HAAG$) for High Performance 3D DRAM ArchitecturesKe Chen, Sheng Li, Jung Ho Ahn, Naveen Muralimanohar, Jishen Zhao, Cong Xu, Seongil O, Yuan Xie 0001, Jay B. Brockman, Norman P. Jouppi. 251-261  [doi]
 - Real-Time In-Memory Checkpointing for Future Hybrid Memory SystemsShen Gao, Bingsheng He, Jianliang Xu. 263-272  [doi]
 - Leveraging Silicon-Photonic NoC for Designing Scalable GPUsAmir Kavyan Ziabari, José L. Abellán, Rafael Ubal, Chao Chen, Ajay Joshi, David R. Kaeli. 273-282  [doi]
 - Automatically Scalable ComputationMargo Seltzer. 283  [doi]
 - Mower: A New Design for Non-blocking Misprediction RecoveryZhaoxiang Jin, Görkem Asilioglu, Soner Önder. 285-294  [doi]
 - zFENCE: Data-less Coherence for Efficient FencesShaizeen Aga, Abhayendra Singh, Satish Narayanasamy. 295-305  [doi]
 - PALMOS: A Transparent, Multi-tasking Acceleration Layer for Parallel Heterogeneous SystemsChristos Margiolas, Michael F. P. O'Boyle. 307-318  [doi]
 - A Nested Partitioning Algorithm for Adaptive Meshes on Heterogeneous ClustersHari Sundar, Omar Ghattas. 319-328  [doi]
 - Criticality-Aware Dynamic Task Scheduling for Heterogeneous ArchitecturesKallia Chronaki, Alejandro Rico, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta, Mateo Valero. 329-338  [doi]
 - CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector MultiplicationWeifeng Liu 0002, Brian Vinter. 339-350  [doi]
 - Optimistic Delinearization of Parametrically Sized ArraysTobias Grosser, Sebastian Pop, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop. 351-360  [doi]
 - DASX: Hardware Accelerator for Software Data StructuresSnehasish Kumar, Naveen Vedula, Arrvindh Shriraman, Vijayalakshmi Srinivasan. 361-372  [doi]
 - Automatic Energy Efficient Parallelization of Uniform Dependence ComputationsYun Zou, Sanjay V. Rajopadhye. 373-382  [doi]
 - ASPaS: A Framework for Automatic SIMDization of Parallel Sorting on x86-based Many-core ProcessorsKaixi Hou, Hao Wang, Wu-chun Feng. 383-392  [doi]
 - Optimizing Overlapped Memory Accesses in User-directed VectorizationDiego Caballero, Sara Royuela, Roger Ferrer, Alejandro Duran, Xavier Martorell. 393-404  [doi]
 - COMPASS: A Framework for Automated Performance Modeling and PredictionSeyong Lee, Jeremy S. Meredith, Jeffrey S. Vetter. 405-414  [doi]
 - Composing Algorithmic Skeletons to Express High-Performance Scientific ApplicationsMani Zandifar, Mustafa Abdul Jabbar, Alireza Majidi, David Keyes, Nancy M. Amato, Lawrence Rauchwerger. 415-424  [doi]
 - STAPL-RTS: An Application Driven Runtime SystemIoannis Papadopoulos, Nathan Thomas, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger. 425-434  [doi]