Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically

Jie Zhao 0002, Cédric Bastoul, Yanzhi Yi, Jiahui Hu, Wang Nie, Renwei Zhang, Zhen Geng, Chong Li, Thibaut Tachon, Zhiliang Gan. Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 451-466, ACM, 2022. [doi]

Abstract

Abstract is missing.