FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models

Jiaao He, Jidong Zhai, Tiago Antunes, Haojie Wang, Fuwen Luo, Shangfeng Shi, Qin Li. FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models. In Jaejin Lee, Kunal Agrawal, Michael F. Spear, editors, PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022. pages 120-134, ACM, 2022. [doi]

Authors

Jiaao He

This author has not been identified. Look up 'Jiaao He' in Google

Jidong Zhai

This author has not been identified. Look up 'Jidong Zhai' in Google

Tiago Antunes

This author has not been identified. Look up 'Tiago Antunes' in Google

Haojie Wang

This author has not been identified. Look up 'Haojie Wang' in Google

Fuwen Luo

This author has not been identified. Look up 'Fuwen Luo' in Google

Shangfeng Shi

This author has not been identified. Look up 'Shangfeng Shi' in Google

Qin Li

This author has not been identified. Look up 'Qin Li' in Google