Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically

Jie Zhao 0002, Cédric Bastoul, Yanzhi Yi, Jiahui Hu, Wang Nie, Renwei Zhang, Zhen Geng, Chong Li, Thibaut Tachon, Zhiliang Gan. Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 451-466, ACM, 2022. [doi]

Authors

Jie Zhao 0002

This author has not been identified. Look up 'Jie Zhao 0002' in Google

Cédric Bastoul

This author has not been identified. Look up 'Cédric Bastoul' in Google

Yanzhi Yi

This author has not been identified. Look up 'Yanzhi Yi' in Google

Jiahui Hu

This author has not been identified. Look up 'Jiahui Hu' in Google

Wang Nie

This author has not been identified. Look up 'Wang Nie' in Google

Renwei Zhang

This author has not been identified. Look up 'Renwei Zhang' in Google

Zhen Geng

This author has not been identified. Look up 'Zhen Geng' in Google

Chong Li

This author has not been identified. Look up 'Chong Li' in Google

Thibaut Tachon

This author has not been identified. Look up 'Thibaut Tachon' in Google

Zhiliang Gan

This author has not been identified. Look up 'Zhiliang Gan' in Google