Abstract is missing.
- Energy efficiency and performance frontiers for sparse computations on GPU supercomputersHartwig Anzt, Stanimire Tomov, Jack J. Dongarra. 1-10 [doi]
- Energy-efficient computing for HPC workloads on heterogeneous manycore chipsAkhil Langer, Ehsan Totoni, Udatta S. Palekar, Laxmikant V. Kalé. 11-19 [doi]
- A performance study of Java garbage collectors on multicore architecturesMaria Carpen Amarie, Patrick Marlier, Pascal Felber, Gaël Thomas. 20-29 [doi]
- Toward an evolutionary task parallel integrated MPI + X programming modelRichard F. Barrett, Dylan T. Stark, Courtenay T. Vaughan, Ryan E. Grant, Stephen L. Olivier, Kevin T. Pedretti. 30-39 [doi]
- Design and evaluation of a novel dataflow based bigdata solutionYao Wu, Long Zheng, Brian Heilig, Guang R. Gao. 40-48 [doi]
- Programming support for reconfigurable custom vector architecturesMehmet Ali Arslan, Krzysztof Kuchcinski, Flavius Gruian, Yangxurui Liu. 49-57 [doi]
- Thread-level parallelization and optimization of NWChem for the Intel MIC architectureHongzhang Shan, Samuel Williams, Wibe De Jong, Leonid Oliker. 58-67 [doi]
- Parallelism vs. speculation: exploiting speculative genetic algorithm on GPUYanchao Lu, Long Zheng, Li Li, Minyi Guo. 68-74 [doi]
- GPU technology applied to reverse time migration and seismic modeling via OpenACCAhmad Qawasmeh, Barbara M. Chapman, Maxime R. Hugues, Henri Calandra. 75-85 [doi]
- Parallelizing a discrete event simulation application using the Habanero-Java multicore libraryWei-Cheng Xiao, Jisheng Zhao, Vivek Sarkar. 86-95 [doi]
- RaftLib: a C++ template library for high performance stream parallel processingJonathan C. Beard, Peng Li, Roger D. Chamberlain. 96-105 [doi]
- A Java util concurrent park contention toolPanagiotis Patros, Eric Aubanel, David Bremner, Michael Dawson. 106-111 [doi]
- Debugging parallel programs using fork handlersJavier Alcázar Zapién. 112-121 [doi]
- Effective communication for a system of cluster-on-a-chip processorsPablo Reble, Stefan Lankes, Fabian Fischer, Matthias S. Müller. 122-131 [doi]
- Exploiting communication concurrency on high performance computing systemsNicholas Chaimov, Khaled Z. Ibrahim, Samuel Williams, Costin Iancu. 132-143 [doi]
- CRA: a dynamic task allocation algorithm for many-core processorChang Wang, Jiang Jiang, Yongxing Zhu, Xu Liu, Xing Han. 144-152 [doi]
- Patty: a pattern-based parallelization tool for the multicore ageKorbinian Molitorisz, Tobias Müller, Walter F. Tichy. 153-163 [doi]
- Deadlock-free buffer configuration for stream computingPeng Li, Jonathan C. Beard, Jeremy Buhler. 164-169 [doi]
- Supporting multiple accelerators in high-level programming modelsYonghong Yan 0001, Pei-Hung Lin, Chunhua Liao, Bronis R. de Supinski, Daniel J. Quinlan. 170-180 [doi]