Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective

Seokjin Go, Joongun Park, Spandan More, Hanjiang Wu, Irene Wang, Aaron Jezghani, Tushar Krishna, Divya Mahajan 0001. Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective. In Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, MICRO 2025, Seoul, Republic of Korea, October 18-22, 2025. pages 626-642, ACM, 2025. [doi]

Abstract

Abstract is missing.