A training-inference consistent framework for early exiting in language models with parallel decoding

Ziqian Zeng, Zelin Chen, Huiping Zhuang. A training-inference consistent framework for early exiting in language models with parallel decoding. Int. J. Machine Learning & Cybernetics, 17(3):121, March 2026. [doi]

Abstract

Abstract is missing.