Abstract is missing.
- Guided profiling for auto-tuning array layouts on GPUsNicolas Weber, Sandra C. Amend, Michael Goesele. [doi]
- Performance evaluation of the IBM POWER8 architecture to support computational neuroscientific application using morphologically detailed neuronsTimothée Ewart, Stuart Yates, Francesco Cremonesi, Pramod S. Kumbhar, Felix Schürmann, Fabien Delalondre. [doi]
- Performance analysis of OpenMP on a GPU using a CORAL proxy applicationGheorghe-Teodor Bercea, Carlo Bertolli, Samuel F. Antão, Arpith C. Jacob, Alexandre E. Eichenberger, Tong Chen, Zehra Sura, Hyojin Sung, Georgios Rokos, David Appelhans, Kevin O'Brien. [doi]
- Characterizing node orderings for improved performanceCarl Albing. [doi]
- Simulating stencil-based application on future Xeon Phi processorChitra Natarajan, Carl J. Beckmann, Anthony Nguyen, Mauricio Araya-Polo, Tryggve Fossum, Detlef Hohl. [doi]
- ARMv8 micro-architectural design space exploration for high performance computing using fractional factorialRoxana Rusitoru. [doi]
- Examining recent many-core architectures and programming models using SHOCM. Graham Lopez, Jeffrey Young, Jeremy S. Meredith, Philip C. Roth, Mitchel D. Horton, Jeffrey S. Vetter. [doi]
- Automatic loop kernel analysis and performance modeling with KerncraftJulian Hammer, Georg Hager, Jan Eitzinger, Gerhard Wellein. [doi]
- Techniques for modeling large-scale HPC I/O workloadsShane Snyder, Philip H. Carns, Robert Latham, Misbah Mubarak, Robert B. Ross, Christopher D. Carothers, Babak Behzad, Huong Vu Thanh Luu, Surendra Byna, Prabhat. [doi]