Scalable, fault tolerant membership for MPI tasks on HPC systems

Jyothish Varma, Chao Wang, Frank Mueller, Christian Engelmann, Stephen L. Scott. Scalable, fault tolerant membership for MPI tasks on HPC systems. In Gregory K. Egan, Yoichi Muraoka, editors, Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006, Cairns, Queensland, Australia, June 28 - July 01, 2006. pages 219-228, ACM, 2006. [doi]

Abstract

Abstract is missing.