Abstract is missing.
- Towards Ad Hoc Recovery for Soft ErrorsNuria Losada, Leonardo Bautista-Gomez, Kai Keller, Osman S. Unsal. 1-10 [doi]
- Fault Tolerant Cholesky Factorization on GPUsFelix Loh, Kewal K. Saluja, Parameswaran Ramanathan. 11-18 [doi]
- Improving Application Resilience by Extending Error Correction with Contextual InformationAlexandra Poulos, Dylan Wallace, Robert Robey, Laura Monroe, Vanessa Job, Sean Blanchard, William M. Jones, Nathan DeBardeleben. 19-28 [doi]
- A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN PlatformYawei Hui, Byung-Hoon Park, Christian Engelmann. 29-38 [doi]
- Analyzing the Impact of System Reliability Events on Applications in the Titan SupercomputerRizwan A. Ashraf, Christian Engelmann. 39-48 [doi]
- Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient MethodsCarlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer. 49-58 [doi]
- CPU Overheating Characterization in HPC Systems: A Case StudyMarc Platini, Thomas Ropars, Benoit Pelletier, Noel De Palma. 59-68 [doi]
- SaNSA - The Supercomputer and Node State ArchitectureNeil Agarwal, Hugh Greenberg, Sean Blanchard, Nathan DeBardeleben. 69-78 [doi]
- Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG SchemesAnne Reinarz, Jean-Mathieu Gallard, Michael Bader. 79-86 [doi]