Supporting Massive DLRM Inference through Software Defined Memory

Ehsan K. Ardestani, Changkyu Kim, Seung Jae Lee, Luoshang Pan, Jens Axboe, Valmiki Rampersad, Banit Agrawal, Fuxun Yu, Ansha Yu, Trung Le, Hector Yuen, Dheevatsa Mudigere, Shishir Juluri, Akshat Nanda, Manoj Wodekar, Krishnakumar Nair, Maxim Naumov, Chris Petersen, Mikhail Smelyanskiy, Vijay Rao. Supporting Massive DLRM Inference through Software Defined Memory. In 42nd IEEE International Conference on Distributed Computing Systems, ICDCS 2022, Bologna, Italy, July 10-13, 2022. pages 302-312, IEEE, 2022. [doi]

@inproceedings{ArdestaniKLPARA22,
  title = {Supporting Massive DLRM Inference through Software Defined Memory},
  author = {Ehsan K. Ardestani and Changkyu Kim and Seung Jae Lee and Luoshang Pan and Jens Axboe and Valmiki Rampersad and Banit Agrawal and Fuxun Yu and Ansha Yu and Trung Le and Hector Yuen and Dheevatsa Mudigere and Shishir Juluri and Akshat Nanda and Manoj Wodekar and Krishnakumar Nair and Maxim Naumov and Chris Petersen and Mikhail Smelyanskiy and Vijay Rao},
  year = {2022},
  doi = {10.1109/ICDCS54860.2022.00037},
  url = {https://doi.org/10.1109/ICDCS54860.2022.00037},
  researchr = {https://researchr.org/publication/ArdestaniKLPARA22},
  cites = {0},
  citedby = {0},
  pages = {302-312},
  booktitle = {42nd IEEE International Conference on Distributed Computing Systems, ICDCS 2022, Bologna, Italy, July 10-13, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-7177-0},
}