Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation

Liu Ke, Udit Gupta, Mark Hempstead, Carole-Jean Wu, Hsien-Hsin S. Lee, Xuan Zhang 0001. Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation. In IEEE International Symposium on High-Performance Computer Architecture, HPCA 2022, Seoul, South Korea, April 2-6, 2022. pages 141-144, IEEE, 2022. [doi]

Abstract

Abstract is missing.