Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Zhaoxia Deng, JongSoo Park, Ping Tak Peter Tang, Haixin Liu, Jie Yang, Hector Yuen, Jianyu Huang, Daya Shanker Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Nadathur Satish, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy. Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale. IEEE Micro, 41(5):93-100, 2021. [doi]

@article{DengPTLYYHKWWCK21,
  title = {Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale},
  author = {Zhaoxia Deng and JongSoo Park and Ping Tak Peter Tang and Haixin Liu and Jie Yang and Hector Yuen and Jianyu Huang and Daya Shanker Khudia and Xiaohan Wei and Ellie Wen and Dhruv Choudhary and Raghuraman Krishnamoorthi and Carole-Jean Wu and Nadathur Satish and Changkyu Kim and Maxim Naumov and Sam Naghshineh and Mikhail Smelyanskiy},
  year = {2021},
  doi = {10.1109/MM.2021.3081981},
  url = {https://doi.org/10.1109/MM.2021.3081981},
  researchr = {https://researchr.org/publication/DengPTLYYHKWWCK21},
  cites = {0},
  citedby = {0},
  journal = {IEEE Micro},
  volume = {41},
  number = {5},
  pages = {93-100},
}