Scalable, fault tolerant membership for MPI tasks on HPC systems

Jyothish Varma, Chao Wang, Frank Mueller, Christian Engelmann, Stephen L. Scott. Scalable, fault tolerant membership for MPI tasks on HPC systems. In Gregory K. Egan, Yoichi Muraoka, editors, Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006, Cairns, Queensland, Australia, June 28 - July 01, 2006. pages 219-228, ACM, 2006. [doi]

@inproceedings{VarmaWMES06,
  title = {Scalable, fault tolerant membership for MPI tasks on HPC systems},
  author = {Jyothish Varma and Chao Wang and Frank Mueller and Christian Engelmann and Stephen L. Scott},
  year = {2006},
  doi = {10.1145/1183401.1183433},
  url = {http://doi.acm.org/10.1145/1183401.1183433},
  researchr = {https://researchr.org/publication/VarmaWMES06},
  cites = {0},
  citedby = {0},
  pages = {219-228},
  booktitle = {Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006, Cairns, Queensland, Australia, June 28 - July 01, 2006},
  editor = {Gregory K. Egan and Yoichi Muraoka},
  publisher = {ACM},
  isbn = {1-59593-282-8},
}