Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU

Zeyu Wang, Xiaoxi He, Zimu Zhou, Xu Wang 0018, Qiang Ma 0007, Xin Miao, Zhuo Liu, Lothar Thiele, Zheng Yang 0002. Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU. In 19th Annual IEEE International Conference on Sensing, Communication, and Networking, SECON 2022, Stockholm, Sweden, September 20-23, 2022. pages 145-153, IEEE, 2022. [doi]

@inproceedings{WangHZ00MLT022,
  title = {Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU},
  author = {Zeyu Wang and Xiaoxi He and Zimu Zhou and Xu Wang 0018 and Qiang Ma 0007 and Xin Miao and Zhuo Liu and Lothar Thiele and Zheng Yang 0002},
  year = {2022},
  doi = {10.1109/SECON55815.2022.9918563},
  url = {https://doi.org/10.1109/SECON55815.2022.9918563},
  researchr = {https://researchr.org/publication/WangHZ00MLT022},
  cites = {0},
  citedby = {0},
  pages = {145-153},
  booktitle = {19th Annual IEEE International Conference on Sensing, Communication, and Networking, SECON 2022, Stockholm, Sweden, September 20-23, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-8643-9},
}