CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism

Henk Dreuning, Kees Verstoep, Henri E. Bal, Rob V. van Nieuwpoort. CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism. In 30th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, December 18-21, 2023. pages 76-86, IEEE, 2023. [doi]

Abstract

Abstract is missing.