Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Zhaoxia Deng, JongSoo Park, Ping Tak Peter Tang, Haixin Liu, Jie Yang, Hector Yuen, Jianyu Huang, Daya Shanker Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Nadathur Satish, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy. Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale. IEEE Micro, 41(5):93-100, 2021. [doi]