Power Capping of GPU Servers for Machine Learning Inference Optimization

Yuan Ma, Srinivasan Subramaniyan, Xiaorui Wang. Power Capping of GPU Servers for Machine Learning Inference Optimization. In Proceedings of the 54th International Conference on Parallel Processing, ICPP 2025, San Diego, CA, USA, September 8-11, 2025. pages 449-459, ACM, 2025. [doi]

Abstract

Abstract is missing.