Performance Optimization of Machine Learning Inference under Latency and Server Power Constraints

Guoyu Chen, Xiaorui Wang. Performance Optimization of Machine Learning Inference under Latency and Server Power Constraints. In 42nd IEEE International Conference on Distributed Computing Systems, ICDCS 2022, Bologna, Italy, July 10-13, 2022. pages 325-335, IEEE, 2022. [doi]

Abstract

Abstract is missing.