On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications

Thomas Ropars, Amina Guermouche, Bora Uçar, Esteban Meneses, Laxmikant V. Kalé, Franck Cappello. On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications. In Emmanuel Jeannot, Raymond Namyst, Jean Roman, editors, Euro-Par 2011 Parallel Processing - 17th International Conference, Euro-Par 2011, Bordeaux, France, August 29 - September 2, 2011, Proceedings, Part I. Volume 6852 of Lecture Notes in Computer Science, pages 567-578, Springer, 2011. [doi]

Authors

Thomas Ropars

This author has not been identified. Look up 'Thomas Ropars' in Google

Amina Guermouche

This author has not been identified. Look up 'Amina Guermouche' in Google

Bora Uçar

This author has not been identified. Look up 'Bora Uçar' in Google

Esteban Meneses

This author has not been identified. Look up 'Esteban Meneses' in Google

Laxmikant V. Kalé

This author has not been identified. Look up 'Laxmikant V. Kalé' in Google

Franck Cappello

This author has not been identified. Look up 'Franck Cappello' in Google