Abstract is missing.
- A Non-Invasive Approach for Realizing Resilience in MPIUmar Kalim, Mark K. Gardner, Wu Feng. 1-8 [doi]
- Optimal Checkpointing Period with Replicated Execution on Heterogeneous PlatformsAnne Benoit, Aurélien Cavelan, Valentin Le Fèvre, Yves Robert. 9-16 [doi]
- Understanding the Spatial Characteristics of DRAM Errors in HPC ClustersAyush Patwari, Ignacio Laguna, Martin Schulz 0001, Saurabh Bagchi. 17-22 [doi]
- Towards New Metrics for High-Performance Computing ResilienceSaurabh Hukerikar, Rizwan A. Ashraf, Christian Engelmann. 23-30 [doi]
- Identifying the Right Replication Level to Detect and Correct Silent Errors at ScaleAnne Benoit, Aurélien Cavelan, Franck Cappello, Padma Raghavan, Yves Robert, Hongynag Sun. 31-38 [doi]