HGP4CNN: an efficient parallelization framework for training convolutional neural networks on modern GPUs

Hao Fu, Shanjiang Tang, Bingsheng He, Ce Yu, Jizhou Sun. HGP4CNN: an efficient parallelization framework for training convolutional neural networks on modern GPUs. The Journal of Supercomputing, 77(11):12741-12770, 2021. [doi]

Abstract

Abstract is missing.