Exploring the Limits of Concurrency in ML Training on Google TPUS

Sameer Kumar, Yu Emma Wang, Cliff Young, James Bradbury, Naveen Kumar, Dehao Chen, Andy Swing. Exploring the Limits of Concurrency in ML Training on Google TPUS. In Alex Smola, Alex Dimakis, Ion Stoica, editors, Proceedings of Machine Learning and Systems 2021, MLSys 2021, virtual, April 5-9, 2021. mlsys.org, 2021. [doi]

Abstract

Abstract is missing.