Lessons Learned Implementing User-Level Failure Mitigation in MPICH

Wesley Bland, Huiwei Lu, Sangmin Seo, Pavan Balaji. Lessons Learned Implementing User-Level Failure Mitigation in MPICH. In 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2015, Shenzhen, China, May 4-7, 2015. pages 1123-1126, IEEE, 2015. [doi]

@inproceedings{BlandLSB15,
  title = {Lessons Learned Implementing User-Level Failure Mitigation in MPICH},
  author = {Wesley Bland and Huiwei Lu and Sangmin Seo and Pavan Balaji},
  year = {2015},
  doi = {10.1109/CCGrid.2015.51},
  url = {http://dx.doi.org/10.1109/CCGrid.2015.51},
  researchr = {https://researchr.org/publication/BlandLSB15},
  cites = {0},
  citedby = {0},
  pages = {1123-1126},
  booktitle = {15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2015, Shenzhen, China, May 4-7, 2015},
  publisher = {IEEE},
  isbn = {978-1-4799-8006-2},
}