System-Level Fault-Tolerance in Large-Scale Parallel Machines with Buffered Coscheduling

Fabrizio Petrini, Kei Davis, José Carlos Sancho. System-Level Fault-Tolerance in Large-Scale Parallel Machines with Buffered Coscheduling. In 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), CD-ROM / Abstracts Proceedings, 26-30 April 2004, Santa Fe, New Mexico, USA. IEEE Computer Society, 2004. [doi]

Abstract

Abstract is missing.