DeepClone: Lightweight State Replication of Deep Learning Models for Data Parallel Training

Bogdan Nicolae, Justin M. Wozniak, Matthieu Dorier, Franck Cappello. DeepClone: Lightweight State Replication of Deep Learning Models for Data Parallel Training. In IEEE International Conference on Cluster Computing, CLUSTER 2020, Kobe, Japan, September 14-17, 2020. pages 226-236, IEEE, 2020. [doi]

Abstract

Abstract is missing.