GPU memory usage optimization for backward propagation in deep network training

Ding-Yong Hong, Tzu-Hsien Tsai, Ning Wang, Pangfeng Liu, Jan-Jan Wu. GPU memory usage optimization for backward propagation in deep network training. J. Parallel Distrib. Comput., 199:105053, 2025. [doi]

Abstract

Abstract is missing.