Yuan Ma, Srinivasan Subramaniyan, Xiaorui Wang. Power Capping of GPU Servers for Machine Learning Inference Optimization. In Proceedings of the 54th International Conference on Parallel Processing, ICPP 2025, San Diego, CA, USA, September 8-11, 2025. pages 449-459, ACM, 2025. [doi]
No references recorded for this publication.
No citations of this publication recorded.