Abstract is missing.
- 591 TFLOPS Multi-trillion Particles Simulation on SuperMUCWolfgang Eckhardt, Alexander Heinecke, Reinhold Bader, Matthias Brehm, Nicolay Hammer, Herbert Huber, Hans-Georg Kleinhenz, Jadran Vrabec, Hans Hasse, Martin Horsch, Martin Bernreuther, Colin W. Glass, Christoph Niethammer, Arndt Bode, Hans-Joachim Bungartz. 1-12 [doi]
- Up to 700k GPU Cores, Kepler, and the Exascale Future for Simulations of Star Clusters Around Black HolesPeter Berczik, Rainer Spurzem, Shiyan Zhong, Long Wang, Keigo Nitadori, Tsuyoshi Hamada, Alexander Veles. 13-25 [doi]
- Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A SupercomputerChuanfu Xu, Xiaogang Deng, Lilun Zhang, Yi Jiang, Wei Cao, Jianbin Fang, Yonggang Che, Yongxian Wang, Wei Liu. 26-39 [doi]
- Lattice QCD on Intel® Xeon PhiTM CoprocessorsBálint Joó, Dhiraj D. Kalamkar, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Kiran Pamnany, Victor W. Lee, Pradeep Dubey, William A. Watson III. 40-54 [doi]
- Towards Addressing CPU-Intensive Seismological Applications in EuropeMichele Carpenè, Iraklis A. Klampanos, Siew Hoon Leong, Emanuele Casarotti, Peter Danecek, Graziella Ferini, André Gemünd, Amrey Krause, Lion Krischer, Federica Magnoni, Marek Simon, Alessandro Spinuso, Luca Trani, Malcolm P. Atkinson, Giovanni Erbacci, Anton Frank, Heiner Igel, Andreas Rietbrock, Horst Schwichtenberg, Jean-Pierre Vilotte. 55-66 [doi]
- Leading Edge Hybrid Multi-GPU Algorithms for Generalized Eigenproblems in Electronic Structure CalculationsAzzam Haidar, Raffaele Solcà, Mark Gates, Stanimire Tomov, Thomas C. Schulthess, Jack Dongarra. 67-80 [doi]
- Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code and Large-Scale Performance Test on TH-1AXiangfei Meng, Xiaoqian Zhu, Peng Wang, Yang Zhao, Xin Liu, Bao Zhang, Yong Xiao, Wenlu Zhang, Zhihong Lin. 81-96 [doi]
- Achieving Efficient Strong Scaling with PETSc Using Hybrid MPI/OpenMP OptimisationMichael Lange, Gerard Gorman, Michèle Weiland, Lawrence Mitchell, James Southern. 97-108 [doi]
- Designing Scalable Graph500 Benchmark with Hybrid MPI+OpenSHMEM Programming ModelsJithin Jose, Sreeram Potluri, Karen Tomko, Dhabaleswar K. Panda. 109-124 [doi]
- On the GPU Performance of 3D Stencil Computations Implemented in OpenCLHuayou Su, Nan Wu 0003, Mei Wen, Chunyuan Zhang, Xing Cai. 125-135 [doi]
- Improving Performance Portability in OpenCL ProgramsYao Zhang, Mark Sinclair II, Andrew A. Chien. 136-150 [doi]
- Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics ProcessorsWalid A. Abu-Sufah, Asma Abdel Karim. 151-164 [doi]
- A Simple Concept for the Performance Analysis of Cluster-ComputingHeinz Kredel, Sabine Richling, Jan Philipp Kruse, Erich Strohmaier, Hans-Günther Kruse. 165-180 [doi]
- Using Simulation to Validate Performance of MPI(-IO) ImplementationsJulian Martin Kunkel. 181-195 [doi]
- Software Design Space Exploration for Exascale Combustion Co-designCy Chan, Didem Unat, Michael Lijewski, Weiqun Zhang, John Bell, John Shalf. 196-212 [doi]
- Beyond the CPU: Hardware Performance Counter Monitoring on Blue Gene/QHeike McCraw, Daniel Terpstra, Jack Dongarra, Kris Davis, Roy G. Musselman. 213-225 [doi]
- Maximizing Application Performance in a Multi-core, NUMA-Aware Compute Cluster by Multi-level TuningGilad Shainer, Pak Lui, Martin Hilgeman, Jeffrey Layton, Cydney Stevens, Walker Stemple, Scot Schultz, Guy Ludden, Joshua Mora, Georg Kresse. 226-238 [doi]
- Offload Compiler Runtime for the Intel Xeon PhiTM CoprocessorChris J. Newburn, Rajiv Deodhar, Serguei Dmitriev, Ravi Murty, Ravi Narayanaswamy, John Wiegert, Francisco Chinchilla, Russell McGuire. 239-254 [doi]
- Fork-Join and Data-Driven Execution Models on Multi-core Architectures: Case Study of the FMMAbdelhalim Amer, Naoya Maruyama, Miquel Pericàs, Kenjiro Taura, Rio Yokota, Satoshi Matsuoka. 255-266 [doi]
- VLI - A Library for High Precision Integer and Polynomial ArithmeticTimothée Ewart, Andreas Hehn, Matthias Troyer. 267-278 [doi]
- Performance-Portable Finite Element Assembly Using PyOP2 and FEniCSGraham R. Markall, Florian Rathgeber, Lawrence Mitchell, Nicolas Loriant, Carlo Bertolli, David A. Ham, Paul H. J. Kelly. 279-289 [doi]
- Container-Based Job Management for Fair Resource SharingJue Hong, Pavan Balaji, Gaojin Wen, Bibo Tu, Junming Yan, Cheng-Zhong Xu, Shengzhong Feng. 290-301 [doi]
- One Size Does Not Fit All: Clustering Supercomputer Failures Using a Multiple Time Window ApproachCatello Di Martino. 302-316 [doi]
- Tracking the Performance Evolution of Blue Gene SystemsDarren J. Kerbyson, Kevin J. Barker, Diego S. Gallo, Dong Chen, José R. Brunheroto, Kyung Dong Ryu, George L.-T. Chiu, Adolfy Hoisie. 317-329 [doi]
- Accelerators for Technical Computing: Is It Worth the Pain? A TCO PerspectiveSandra Wienke, Dieter an Mey, Matthias S. Müller. 330-342 [doi]
- Evaluating Lossy Compression on Climate DataNathanael Hübbe, Al Wegener, Julian Martin Kunkel, Yi Ling, Thomas Ludwig 0002. 343-356 [doi]
- The Effect of Topology-Aware Process and Thread Placement on Performance and EnergyAlbert Solernou, Jeyarajan Thiyagalingam, Mihai C. Duta, Anne E. Trefethen. 357-371 [doi]
- TUE, a New Energy-Efficiency Metric Applied at ORNL's JaguarMichael K. Patterson, Stephen W. Poole, Chung-Hsing Hsu, Don Maxwell, William Tschudi, Henry Coles, David J. Martinez, Natalie Bates. 372-382 [doi]
- iDataCool: HPC with Hot-Water Cooling and Energy ReuseNils Meyer, Manfred Ries, Stefan Solbrig, Tilo Wettig. 383-394 [doi]
- Pre-execution Data Prefetching with Inter-thread I/O SchedulingYue Zhao, Kenji Yoshigoe, Mengjun Xie. 395-407 [doi]
- A Semantics-Aware I/O Interface for High Performance ComputingMichael Kuhn. 408-421 [doi]
- Towards Self-optimization in HPC I/OMichaela Zimmer, Julian Martin Kunkel, Thomas Ludwig 0002. 422-434 [doi]
- Using GPFS to Manage NVRAM-Based Storage CacheSalem El Sayed, Stephan Graf, Michael Hennecke, Dirk Pleiter, Georg Schwarz, Heiko Schick, Michael Stephan. 435-446 [doi]
- VM-MAD: A Cloud/Cluster Software for Service-Oriented Academic EnvironmentsTyanko Aleksiev, Simon Barkow-Oesterreicher, Peter Z. Kunszt, Sergio Maffioletti, Riccardo Murri, Christian Panse. 447-461 [doi]
- Federating HPC Access via SAML: Towards a Plug-and-Play SolutionJens Köhler, Michael Simon, Martin Nussbaumer, Hannes Hartenstein. 462-473 [doi]