Training of deep learning pipelines on memory-constrained GPUs via segmented fused-tiled execution

Yufan Xu, Saurabh Raje, Atanas Rountev, Gerald Sabin, Aravind Sukumaran-Rajam, P. Sadayappan. Training of deep learning pipelines on memory-constrained GPUs via segmented fused-tiled execution. In Bernhard Egger, Aaron Smith, editors, CC '22: 31st ACM SIGPLAN International Conference on Compiler Construction, Seoul, South Korea, April 2 - 3, 2022. pages 104-116, ACM, 2022. [doi]

Authors

Yufan Xu

This author has not been identified. Look up 'Yufan Xu' in Google

Saurabh Raje

This author has not been identified. Look up 'Saurabh Raje' in Google

Atanas Rountev

This author has not been identified. Look up 'Atanas Rountev' in Google

Gerald Sabin

This author has not been identified. Look up 'Gerald Sabin' in Google

Aravind Sukumaran-Rajam

This author has not been identified. Look up 'Aravind Sukumaran-Rajam' in Google

P. Sadayappan

This author has not been identified. It may be one of the following persons: Look up 'P. Sadayappan' in Google