Pluggable Watchdog: Transparent Failure Detection for MPI Programs

Keun Soo Yim, Zbigniew Kalbarczyk, Ravishankar K. Iyer. Pluggable Watchdog: Transparent Failure Detection for MPI Programs. In 27th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2013, Cambridge, MA, USA, May 20-24, 2013. pages 489-500, IEEE Computer Society, 2013. [doi]

Abstract

Abstract is missing.