Power Capping of GPU Servers for Machine Learning Inference Optimization

Yuan Ma, Srinivasan Subramaniyan, Xiaorui Wang. Power Capping of GPU Servers for Machine Learning Inference Optimization. In Proceedings of the 54th International Conference on Parallel Processing, ICPP 2025, San Diego, CA, USA, September 8-11, 2025. pages 449-459, ACM, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.