Wesley Bland, Peng Du, Aurelien Bouteiller, Thomas Hérault, George Bosilca, Jack Dongarra. A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI. In Christos Kaklamanis, Theodore S. Papatheodorou, Paul G. Spirakis, editors, Euro-Par 2012 Parallel Processing - 18th International Conference, Euro-Par 2012, Rhodes Island, Greece, August 27-31, 2012. Proceedings. Volume 7484 of Lecture Notes in Computer Science, pages 477-488, Springer, 2012. [doi]
Abstract is missing.