Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU

Zeyu Wang, Xiaoxi He, Zimu Zhou, Xu Wang 0018, Qiang Ma 0007, Xin Miao, Zhuo Liu, Lothar Thiele, Zheng Yang 0002. Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU. In 19th Annual IEEE International Conference on Sensing, Communication, and Networking, SECON 2022, Stockholm, Sweden, September 20-23, 2022. pages 145-153, IEEE, 2022. [doi]

Authors

Zeyu Wang

This author has not been identified. Look up 'Zeyu Wang' in Google

Xiaoxi He

This author has not been identified. Look up 'Xiaoxi He' in Google

Zimu Zhou

This author has not been identified. Look up 'Zimu Zhou' in Google

Xu Wang 0018

This author has not been identified. Look up 'Xu Wang 0018' in Google

Qiang Ma 0007

This author has not been identified. Look up 'Qiang Ma 0007' in Google

Xin Miao

This author has not been identified. Look up 'Xin Miao' in Google

Zhuo Liu

This author has not been identified. Look up 'Zhuo Liu' in Google

Lothar Thiele

This author has not been identified. Look up 'Lothar Thiele' in Google

Zheng Yang 0002

This author has not been identified. Look up 'Zheng Yang 0002' in Google