iCheck: Leveraging RDMA and Malleability for Application-Level Checkpointing in HPC Systems

Jophin John, Isaac David Núñez Araya, Michael Gerndt. iCheck: Leveraging RDMA and Malleability for Application-Level Checkpointing in HPC Systems. In 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022, Nanjing, China, January 10-12, 2023. pages 467-474, IEEE, 2022. [doi]

Abstract

Abstract is missing.