Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs

Xueying Wang, Guangli Li, Xiao Dong, Jiansong Li, Lei Liu 0030, Xiaobing Feng 0002. Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs. In Maciej Malawski, Krzysztof Rzadca, editors, Euro-Par 2020: Parallel Processing - 26th International Conference on Parallel and Distributed Computing, Warsaw, Poland, August 24-28, 2020, Proceedings. Volume 12247 of Lecture Notes in Computer Science, pages 219-233, Springer, 2020. [doi]

Abstract

Abstract is missing.