Reading between the lines of failure logs: Understanding how HPC systems fail

Nosayba El-Sayed, Bianca Schroeder. Reading between the lines of failure logs: Understanding how HPC systems fail. In 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Budapest, Hungary, June 24-27, 2013. pages 1-12, IEEE, 2013. [doi]

Abstract

Abstract is missing.