Taming of the Shrew: Modeling the Normal and Faulty Behaviour of Large-scale HPC Systems

Ana Gainaru, Franck Cappello, William Kramer. Taming of the Shrew: Modeling the Normal and Faulty Behaviour of Large-scale HPC Systems. In 26th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2012, Shanghai, China, May 21-25, 2012. pages 1168-1179, IEEE Computer Society, 2012. [doi]

Abstract

Abstract is missing.