EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism

Yanxi Chen, Xuchen Pan, Yaliang Li, Bolin Ding, Jingren Zhou. EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

Authors

Yanxi Chen

This author has not been identified. Look up 'Yanxi Chen' in Google

Xuchen Pan

This author has not been identified. Look up 'Xuchen Pan' in Google

Yaliang Li

This author has not been identified. Look up 'Yaliang Li' in Google

Bolin Ding

This author has not been identified. Look up 'Bolin Ding' in Google

Jingren Zhou

This author has not been identified. Look up 'Jingren Zhou' in Google