Haotong Qin, Xudong Ma, Xingyu Zheng, Xiaoyang Li, Yang Zhang 0088, Shouda Liu, Jie Luo 0004, Xianglong Liu 0001, Michele Magno. Accurate LoRA-Finetuning Quantization of LLMs via Information Retention. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]
Abstract is missing.