Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically

Jie Zhao 0002, Cédric Bastoul, Yanzhi Yi, Jiahui Hu, Wang Nie, Renwei Zhang, Zhen Geng, Chong Li, Thibaut Tachon, Zhiliang Gan. Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically. In Andreas Klöckner, José Moreira, editors, Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022. pages 451-466, ACM, 2022. [doi]

@inproceedings{ZhaoBYHNZGLTG22,
  title = {Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically},
  author = {Jie Zhao 0002 and Cédric Bastoul and Yanzhi Yi and Jiahui Hu and Wang Nie and Renwei Zhang and Zhen Geng and Chong Li and Thibaut Tachon and Zhiliang Gan},
  year = {2022},
  doi = {10.1145/3559009.3569656},
  url = {https://doi.org/10.1145/3559009.3569656},
  researchr = {https://researchr.org/publication/ZhaoBYHNZGLTG22},
  cites = {0},
  citedby = {0},
  pages = {451-466},
  booktitle = {Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, Chicago, Illinois, October 8-12, 2022},
  editor = {Andreas Klöckner and José Moreira},
  publisher = {ACM},
  isbn = {978-1-4503-9868-8},
}