On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications

Thomas Ropars, Amina Guermouche, Bora Uçar, Esteban Meneses, Laxmikant V. Kalé, Franck Cappello. On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications. In Emmanuel Jeannot, Raymond Namyst, Jean Roman, editors, Euro-Par 2011 Parallel Processing - 17th International Conference, Euro-Par 2011, Bordeaux, France, August 29 - September 2, 2011, Proceedings, Part I. Volume 6852 of Lecture Notes in Computer Science, pages 567-578, Springer, 2011. [doi]

@inproceedings{RoparsGUMKC11,
  title = {On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications},
  author = {Thomas Ropars and Amina Guermouche and Bora Uçar and Esteban Meneses and Laxmikant V. Kalé and Franck Cappello},
  year = {2011},
  doi = {10.1007/978-3-642-23400-2_53},
  url = {http://dx.doi.org/10.1007/978-3-642-23400-2_53},
  tags = {rule-based},
  researchr = {https://researchr.org/publication/RoparsGUMKC11},
  cites = {0},
  citedby = {0},
  pages = {567-578},
  booktitle = {Euro-Par 2011 Parallel Processing - 17th International Conference, Euro-Par 2011, Bordeaux, France, August 29 - September 2, 2011, Proceedings, Part I},
  editor = {Emmanuel Jeannot and Raymond Namyst and Jean Roman},
  volume = {6852},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-642-23399-9},
}