QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa. QuIP: 2-Bit Quantization of Large Language Models With Guarantees. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Jerry Chee

This author has not been identified. Look up 'Jerry Chee' in Google

Yaohui Cai

This author has not been identified. Look up 'Yaohui Cai' in Google

Volodymyr Kuleshov

This author has not been identified. Look up 'Volodymyr Kuleshov' in Google

Christopher De Sa

This author has not been identified. Look up 'Christopher De Sa' in Google