Communication-Efficient Model Parallelism for Distributed In-Situ Transformer Inference

Yuanxin Wei, Shengyuan Ye, Jiazhi Jiang, Xu Chen, Dan Huang, Jiangsu Du, Yutong Lu. Communication-Efficient Model Parallelism for Distributed In-Situ Transformer Inference. In Design, Automation & Test in Europe Conference & Exhibition, DATE 2024, Valencia, Spain, March 25-27, 2024. pages 1-6, IEEE, 2024. [doi]

@inproceedings{WeiYJCHDL24,
  title = {Communication-Efficient Model Parallelism for Distributed In-Situ Transformer Inference},
  author = {Yuanxin Wei and Shengyuan Ye and Jiazhi Jiang and Xu Chen and Dan Huang and Jiangsu Du and Yutong Lu},
  year = {2024},
  url = {https://ieeexplore.ieee.org/document/10546617},
  researchr = {https://researchr.org/publication/WeiYJCHDL24},
  cites = {0},
  citedby = {0},
  pages = {1-6},
  booktitle = {Design, Automation & Test in Europe Conference & Exhibition, DATE 2024, Valencia, Spain, March 25-27, 2024},
  publisher = {IEEE},
  isbn = {978-3-9819263-8-5},
}