Fault Tolerance in Large-Scale Scientific Computing

Patricia D. Hough, Victoria E. Howle. Fault Tolerance in Large-Scale Scientific Computing. In Michael A. Heroux, Padma Raghavan, Horst D. Simon, editors, Parallel Processing for Scientific Computing. Volume 20 of Software, Environments, Tools, pages 203-220, SIAM, 2006. [doi]

Abstract

Abstract is missing.