Abstract is missing.
- Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUsEdmond Chow, Hartwig Anzt, Jack Dongarra. 1-16 [doi]
- Matrix Multiplication on High-Density Multi-GPU Architectures: Theoretical and Experimental InvestigationsPeng Zhang, Yu-Xiang Gao. 17-30 [doi]
- A Framework for Batched and GPU-Resident Factorization Algorithms Applied to Block Householder TransformationsAzzam Haidar, Tingxing Tim Dong, Stanimire Tomov, Piotr Luszczek, Jack Dongarra. 31-47 [doi]
- Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore PlatformsMd. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, JongSoo Park, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das 0002, Sergey G. Pudov, Vadim O. Pirogov, Pradeep Dubey. 48-57 [doi]
- On the Design, Development, and Analysis of Optimized Matrix-Vector Multiplication Routines for CoprocessorsKhairul Kabir, Azzam Haidar, Stanimire Tomov, Jack Dongarra. 58-73 [doi]
- Large-Scale Neo-Heterogeneous Programming and Optimization of SNP Detection on Tianhe-2Yingbo Cui, Xiangke Liao, Shaoliang Peng, Yutong Lu, Canqun Yang, Bingqiang Wang, Chengkun Wu. 74-86 [doi]
- ACCOLADES: A Scalable Workflow Framework for Large-Scale Simulation and Analyses of Automotive EnginesShashi M. Aithal, Stefan M. Wild. 87-95 [doi]
- Accelerating LBM and LQCD Application Kernels by In-Memory ProcessingPaul F. Baumeister, Hans Boettiger, José R. Brunheroto, Thorsten Hater, Thilo Maurer, Andrea Nobile, Dirk Pleiter. 96-112 [doi]
- On Quantum Chemistry Code Adaptation for RSC PetaStream ArchitectureVladimir Mironov, Maria Khrenova, Alexander Moskovsky. 113-121 [doi]
- Dtree: Dynamic Task Scheduling at PetascaleKiran Pamnany, Sanchit Misra, Md. Vasimuddin, Xing Liu, Edmond Chow, Srinivas Aluru. 122-138 [doi]
- Feasibility Study of Porting a Particle Transport Code to FPGAIakovos Panourgias, Michèle Weiland, Mark Parsons, David Turland, Dave Barrett, Wayne P. Gaudin. 139-154 [doi]
- A Scalable, Linear-Time Dynamic Cutoff Algorithm for Molecular DynamicsPaul Springer, Ahmed E. Ismail, Paolo Bientinesi. 155-170 [doi]
- BWTCP: A Parallel Method for Constructing BWT in Large Collection of Genomic ReadsHeng Wang, Shaoliang Peng, Yutong Lu, Chengkun Wu, Jiajun Wen, Jie Liu, Xiaoqian Zhu. 171-178 [doi]
- Lattice-CSC: Optimizing and Building an Efficient Supercomputer for Lattice-QCD and to Achieve First Place in Green500David Rohr, Matthias Bach, Gvozden Neskovic, Volker Lindenstruth, Christopher Pinke, Owe Philipsen. 179-196 [doi]
- An Efficient Clique-Based Algorithm of Compute Nodes Allocation for In-memory Checkpoint SystemXiangke Liao, Canqun Yang, Zhe Quan, Tao Tang, Cheng Chen. 197-211 [doi]
- A Scalable Algorithm for Radiative Heat Transfer Using Reverse Monte Carlo Ray TracingAlan Humphrey, Todd Harman, Martin Berzins, Phillip Smith. 212-230 [doi]
- Optimizing Processes Mapping for Tasks with Non-uniform Data Exchange Run on Cluster with Different InterconnectsVictor Getmanskiy, Vladimir Chalyshev, Dmitriy Kryzhanovsky, Igor Lopatin, Evgeny Leksikov. 231-239 [doi]
- Dynamically Adaptable I/O Semantics for High Performance ComputingMichael Kuhn. 240-256 [doi]
- Predicting Performance of Non-contiguous I/O with Machine LearningJulian M. Kunkel, Michaela Zimmer, Eugen Betke. 257-273 [doi]
- A Best Practice Analysis of HDF 5 5 and NetCDF- 4 4 Using LustreChristopher Bartz, Konstantinos Chasapis, Michael Kuhn 0003, Petra Nerge, Thomas Ludwig 0002. 274-281 [doi]
- Striping Layout Aware Data Aggregation for High Performance I/O on a Lustre File SystemYuichi Tsujita, Atsushi Hori, Yutaka Ishikawa. 282-290 [doi]
- Hop: Elastic Consistency for Exascale Data StoresLatchesar Ionkov, Michael Lang. 291-306 [doi]
- Energy-Efficient Data Processing Through Data Sparsing with ArtifactsPablo Graubner, Patrick Heckmann, Bernd Freisleben. 307-322 [doi]
- Updating the Energy Model for Future Exascale SystemsPeter M. Kogge. 323-339 [doi]
- High-Order ADER-DG Minimizes Energy- and Time-to-Solution of SeisSolAlexander Breuer, Alexander Heinecke, Leonhard Rannabauer, Michael Bader. 340-357 [doi]
- Modeling the Productivity of HPC Systems on a Computing Center ScaleSandra Wienke, Hristo Iliev, Dieter an Mey, Matthias S. Müller. 358-375 [doi]
- Taking Advantage of Node Power Variation in Homogenous HPC Systems to Save EnergyTorsten Wilde, Axel Auweter, Hayk Shoukourian, Arndt Bode. 376-393 [doi]
- A Run-Time System for Power-Constrained HPC ApplicationsAniruddha Marathe, Peter E. Bailey, David K. Lowenthal, Barry Rountree, Martin Schulz, Bronis R. de Supinski. 394-408 [doi]
- A Machine Learning Approach for a Scalable, Energy-Efficient Utility-Based Cache PartitioningIsa Ahmet Guney, Abdullah Yildiz, Ismail Ugur Bayindir, Kemal Cagri Serdaroglu, Utku Bayik, Gurhan Kucuk. 409-421 [doi]
- A Case Study - Cost of Preemption for Urgent Computing on SuperMUCSiew Hoon Leong, Dieter Kranzlmüller. 422-433 [doi]
- Designing Non-blocking Personalized Collectives with Near Perfect Overlap for RDMA-Enabled ClustersHari Subramoni, Ammar Ahmad Awan, Khaled Hamidouche, Dmitry Pekurovsky, Akshay Venkatesh, Sourav Chakraborty 0003, Karen Tomko, Dhabaleswar K. Panda. 434-453 [doi]
- Design Methodology for Optimizing Optical Interconnection Networks in High Performance SystemsSébastien Rumley, Madeleine Glick, Simon D. Hammond, Arun Rodrigues, Keren Bergman. 454-471 [doi]
- Quantifying Communication in Graph AnalyticsAndreea Anghel, Germán Rodríguez, Bogdan Prisacari, Cyriel Minkenberg, Gero Dittmann. 472-487 [doi]
- Formal Metrics for Large-Scale Parallel PerformanceKenneth Moreland, Ron A. Oldfield. 488-496 [doi]
- Hunting Down Load Imbalance: A Moving TargetChristoph Pospiech. 497-505 [doi]
- Orchestrating Docker Containers in the HPC EnvironmentJoshua Higgins, Violeta Holmes, Colin C. Venters. 506-513 [doi]
- Performance and Scaling of WRF on Three Different Parallel SupercomputersZaphiris Christidis. 514-528 [doi]