Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management

Jiarui Fang, Zilin Zhu, Shenggui Li, Hui Su, Yang Yu, Jie Zhou, Yang You. Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management. IEEE Trans. Parallel Distrib. Syst., 34(1):304-315, 2023. [doi]

Abstract

Abstract is missing.