A Fault Tolerance Manager with Distributed Coordinated Checkpoints for Automatic Recovery

Jorge Villamayor, Dolores Rexachs, Emilio Luque. A Fault Tolerance Manager with Distributed Coordinated Checkpoints for Automatic Recovery. In 2017 International Conference on High Performance Computing & Simulation, HPCS 2017, Genoa, Italy, July 17-21, 2017. pages 452-459, IEEE, 2017. [doi]

Abstract

Abstract is missing.