Jing Liu, Ruihao Gong, Xiuying Wei, Zhiwei Dong, Jianfei Cai 0001, Bohan Zhuang. QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models. In The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024. [doi]
Abstract is missing.