PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Dawei Zhu, Nan Yang 0002, Liang Wang 0046, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li. PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training. In The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024. [doi]

Abstract

Abstract is missing.