DOJO: Super-Compute System Scaling for ML Training

Bill Chang, Rajiv Kurian, Doug Williams, Eric Quinnell. DOJO: Super-Compute System Scaling for ML Training. In 2022 IEEE Hot Chips 34 Symposium, HCS 2022, Cupertino, CA, USA, August 21-23, 2022. pages 1-45, IEEE, 2022. [doi]

Abstract

Abstract is missing.