Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk

Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li 0008, Qun Liu 0001, Xipeng Qiu. Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. pages 21021-21034, Association for Computational Linguistics, 2024. [doi]

Authors

Zhiyuan Zeng

This author has not been identified. Look up 'Zhiyuan Zeng' in Google

Qipeng Guo

This author has not been identified. Look up 'Qipeng Guo' in Google

Xiaoran Liu

This author has not been identified. Look up 'Xiaoran Liu' in Google

Zhangyue Yin

This author has not been identified. Look up 'Zhangyue Yin' in Google

Wentao Shu

This author has not been identified. Look up 'Wentao Shu' in Google

Mianqiu Huang

This author has not been identified. Look up 'Mianqiu Huang' in Google

Bo Wang

This author has not been identified. Look up 'Bo Wang' in Google

Yunhua Zhou

This author has not been identified. Look up 'Yunhua Zhou' in Google

Linlin Li 0008

This author has not been identified. Look up 'Linlin Li 0008' in Google

Qun Liu 0001

This author has not been identified. Look up 'Qun Liu 0001' in Google

Xipeng Qiu

This author has not been identified. Look up 'Xipeng Qiu' in Google