System-Level Scalable Checkpoint-Restart for Petascale Computing

Jiajun Cao, Kapil Arya, Rohan Garg, L. Shawn Matott, Dhabaleswar K. Panda, Hari Subramoni, Jérôme Vienne, Gene Cooperman. System-Level Scalable Checkpoint-Restart for Petascale Computing. In 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016, Wuhan, China, December 13-16, 2016. pages 932-941, IEEE, 2016. [doi]

Abstract

Abstract is missing.