Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically

Jie Zhao 0002, Cédric Bastoul, Yanzhi Yi, Jiahui Hu, Wang Nie, Renwei Zhang, Zhen Geng, Chong Li, Thibaut Tachon, Zhiliang Gan. Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 451-466, ACM, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.