Sibylla: To Retry or Not To Retry on Deep Learning Job Failure

Taeyoon Kim, Suyeon Jeong, Jongseop Lee, Soobee Lee, Myeongjae Jeon. Sibylla: To Retry or Not To Retry on Deep Learning Job Failure. In Jiri Schindler, Noa Zilberman, editors, 2022 USENIX Annual Technical Conference, USENIX ATC 2022, Carlsbad, CA, USA, July 11-13, 2022. pages 263-270, USENIX Association, 2022. [doi]

Abstract

Abstract is missing.