Abstract is missing.
- QUARC: An Array Programming Approach to High Performance ComputingDiptorup Deb, Robert J. Fowler, Allan Porterfield. 3-17 [doi]
- Utilizing Concurrency: A New Theory for Memory WallXian-He Sun, Yu-Hang Liu. 18-23 [doi]
- ParFuse: Parallel and Compositional Analysis of Message Passing ProgramsSriram Aananthakrishnan, Greg Bronevetsky, Mark Baranowski, Ganesh Gopalakrishnan. 24-39 [doi]
- Fast Approximate Distance Queries in Unweighted Graphs Using Bounded AsynchronyAdam Fidel, Francisco Coral-Sabido, Colton Riedel, Nancy M. Amato, Lawrence Rauchwerger. 40-54 [doi]
- Energy Avoiding Matrix MultiplyKelly Livingston, Aaron Landwehr, José Monsalve, Stéphane Zuckerman, Benoît Meister, Guang R. Gao. 55-70 [doi]
- Language Support for Reliable Memory RegionsSaurabh Hukerikar, Christian Engelmann. 73-87 [doi]
- Harnessing Parallelism in Multicore Systems to Expedite and Improve Function ApproximationAurangzeb, Rudolf Eigenmann. 88-92 [doi]
- Adaptive Software Caching for Efficient NVRAM Data PersistencePengcheng Li, Dhruva R. Chakrabarti. 93-97 [doi]
- Polyhedral Compiler Technology in Collaboration with Autotuning Important to Domain-Specific Frameworks for HPCMary W. Hall, Protonu Basu. 101-105 [doi]
- An Extended Polyhedral Model for SPMD Programs and Its Use in Static Data Race DetectionPrasanth Chatarasi, Jun Shirako, Martin Kong, Vivek Sarkar. 106-120 [doi]
- Polygonal Iteration Space PartitioningAniket Shivam, Alexandru Nicolau, Alexander V. Veidenbaum, Mario Mango Furnari, Rosario Cammarota. 121-136 [doi]
- Automatically Optimizing Stencil Computations on Many-Core NUMA ArchitecturesPei-Hung Lin, Qing Yi, Daniel J. Quinlan, Chunhua Liao, Yongqing Yan. 137-152 [doi]
- Formalizing Structured Control Flow GraphsAmit Sabne, Putt Sakdhnagool, Rudolf Eigenmann. 153-168 [doi]
- Automatic Vectorization for MATLABHanfeng Chen, Alexander Krolik, Erick Lavoie, Laurie J. Hendren. 171-187 [doi]
- Analyzing Parallel Programming Models for Magnetic Resonance ImagingForest Danford, Eric Welch, Julio Cárdenas-Ródriguez, Michelle Mills Strout. 188-202 [doi]
- The Importance of Efficient Fine-Grain Synchronization for Many-Core SystemsTongsheng Geng, Stéphane Zuckerman, José Monsalve, Alfredo Goldman, Sami Habib, Jean-Luc Gaudiot, Guang R. Gao. 203-217 [doi]
- Optimizing LOBPCG: Sparse Matrix Loop and Data Transformations in ActionKhalid Ahmad, Anand Venkat, Mary W. Hall. 218-232 [doi]
- LightHouse: An Automatic Code Generator for Graph Algorithms on GPUsG. Shashidhar, Rupesh Nasre. 235-249 [doi]
- Locality-Aware Task-Parallel Execution on GPUsJad Hbeika, Milind Kulkarni. 250-264 [doi]
- Automatic Copying of Pointer-Based Data StructuresTong Chen, Zehra Sura, Hyojin Sung. 265-281 [doi]
- Automatic Local Memory Management for Multicores Having Global Address SpaceKouhei Yamamoto, Tomoya Shirakawa, Yoshitake Oki, Akimasa Yoshida, Keiji Kimura, Hironori Kasahara. 282-296 [doi]
- Mapping Medley: Adaptive Parallelism Mapping with Varying Optimization GoalsMurali Krishna Emani. 299-313 [doi]
- The Contention Avoiding Concurrent Priority QueueKonstantinos F. Sagonas, Kjell Winblad. 314-330 [doi]
- Evaluating Performance of Task and Data Coarsening in Concurrent CollectionsChenyang Liu, Milind Kulkarni. 331-345 [doi]