Hao Ge, Fangcheng Fu, Haoyang Li, Xuanyu Wang, Sheng Lin, Yujie Wang, Xiaonan Nie, Hailin Zhang 0004, Xupeng Miao, Bin Cui 0001. Enabling Parallelism Hot Switching for Efficient Training of Large Language Models. In Emmett Witchel, Christopher J. Rossbach, Andrea C. Arpaci-Dusseau, Kimberly Keeton, editors, Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, SOSP 2024, Austin, TX, USA, November 4-6, 2024. pages 178-194, ACM, 2024. [doi]
Abstract is missing.