Yanxi Chen, Xuchen Pan, Yaliang Li, Bolin Ding, Jingren Zhou. EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]
Abstract is missing.