Anticipatory Resource Allocation for ML Training

Tapan Chugh, Srikanth Kandula, Arvind Krishnamurthy, Ratul Mahajan, Ishai Menache. Anticipatory Resource Allocation for ML Training. In Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, Santa Cruz, CA, USA, 30 October 2023 - 1 November 2023. pages 410-426, ACM, 2023. [doi]

Abstract

Abstract is missing.