MMPI: A Scalable Fault Tolerance Mechanism for MPI Large Scale Parallel Computing

Zhiyuan Wang, Xuejun Yang, Yun Zhou. MMPI: A Scalable Fault Tolerance Mechanism for MPI Large Scale Parallel Computing. In 10th IEEE International Conference on Computer and Information Technology, CIT 2010, Bradford, West Yorkshire, UK, June 29-July 1, 2010. pages 1251-1256, IEEE Computer Society, 2010. [doi]

Authors

Zhiyuan Wang

This author has not been identified. Look up 'Zhiyuan Wang' in Google

Xuejun Yang

This author has not been identified. Look up 'Xuejun Yang' in Google

Yun Zhou

This author has not been identified. Look up 'Yun Zhou' in Google