Multivariate LSTM for Execution Time Prediction in HPC for Distributed Deep Learning Training

Tasnim Assali, Zayneb Trabelsi Ayoub, Sofiane Ouni. Multivariate LSTM for Execution Time Prediction in HPC for Distributed Deep Learning Training. In 27th IEEE International Symposium on Real-Time Distributed Computing, ISORC 2024, Tunis, Tunisia, May 22-25, 2024. pages 1-5, IEEE, 2024. [doi]

Abstract

Abstract is missing.