TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML

Thorsten Kurth, Mikhail Smorkalov, Peter Mendygral, Srinivas Sridharan 0002, Amrita Mathuriya. TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML. Concurrency - Practice and Experience, 31(16), 2019. [doi]

Abstract

Abstract is missing.