Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models

Liang Li, Qingyuan Li 0001, Bo Zhang 0046, Xiangxiang Chu. Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 18536-18544, AAAI Press, 2024. [doi]

Authors

Liang Li

This author has not been identified. Look up 'Liang Li' in Google

Qingyuan Li 0001

This author has not been identified. Look up 'Qingyuan Li 0001' in Google

Bo Zhang 0046

This author has not been identified. Look up 'Bo Zhang 0046' in Google

Xiangxiang Chu

This author has not been identified. Look up 'Xiangxiang Chu' in Google