Zifei Xu, Sayeh Sharify, Wanzin Yazar, Tristan Webb, Xin Wang. Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs. In International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025. pages 1-8, IEEE, 2025. [doi]
Abstract is missing.