Xinkuang Geng, Siting Liu 0001, Hui Wang 0023, Jie Han 0001, Honglan Jiang. SA-ANT: Efficient Low-Bit Group-Wise Quantization for Large Language Models via Sign-Asymmetric Adaptive Numeric Type. In Design, Automation & Test in Europe Conference, DATE 2026, Verona, Italy, April 20-22, 2026. pages 1-7, IEEE, 2026. [doi]
Abstract is missing.