Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs

Xueying Wang, Guangli Li, Xiao Dong, Jiansong Li, Lei Liu 0030, Xiaobing Feng 0002. Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs. In Maciej Malawski, Krzysztof Rzadca, editors, Euro-Par 2020: Parallel Processing - 26th International Conference on Parallel and Distributed Computing, Warsaw, Poland, August 24-28, 2020, Proceedings. Volume 12247 of Lecture Notes in Computer Science, pages 219-233, Springer, 2020. [doi]

Authors

Xueying Wang

This author has not been identified. Look up 'Xueying Wang' in Google

Guangli Li

This author has not been identified. Look up 'Guangli Li' in Google

Xiao Dong

This author has not been identified. Look up 'Xiao Dong' in Google

Jiansong Li

This author has not been identified. Look up 'Jiansong Li' in Google

Lei Liu 0030

This author has not been identified. Look up 'Lei Liu 0030' in Google

Xiaobing Feng 0002

This author has not been identified. Look up 'Xiaobing Feng 0002' in Google