DeepFreeze: Towards Scalable Asynchronous Checkpointing of Deep Learning Models

Bogdan Nicolae, Jiali Li, Justin M. Wozniak, George Bosilca, Matthieu Dorier, Franck Cappello. DeepFreeze: Towards Scalable Asynchronous Checkpointing of Deep Learning Models. In 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGRID 2020, Melbourne, Australia, May 11-14, 2020. pages 172-181, IEEE, 2020. [doi]

Authors

Bogdan Nicolae

This author has not been identified. Look up 'Bogdan Nicolae' in Google

Jiali Li

This author has not been identified. Look up 'Jiali Li' in Google

Justin M. Wozniak

This author has not been identified. Look up 'Justin M. Wozniak' in Google

George Bosilca

This author has not been identified. Look up 'George Bosilca' in Google

Matthieu Dorier

This author has not been identified. Look up 'Matthieu Dorier' in Google

Franck Cappello

This author has not been identified. Look up 'Franck Cappello' in Google