Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Zhongzhi Yu, Zheng Wang, Yonggan Fu, Huihong Shi, Khalid Shaikh, Yingyan Celine Lin. Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. pages 57659-57677, OpenReview.net, 2024. [doi]

This author has not been identified. Look up 'Zhongzhi Yu' in GoogleThis author has not been identified. Look up 'Zheng Wang' in GoogleThis author has not been identified. Look up 'Yonggan Fu' in GoogleThis author has not been identified. Look up 'Huihong Shi' in GoogleThis author has not been identified. Look up 'Khalid Shaikh' in GoogleThis author has not been identified. Look up 'Yingyan Celine Lin' in Google

runs on WebDSL