Accumulator-Aware Post-Training Quantization for Large Language Models

Ian Colbert, Giuseppe Franco, Fabian Grob, Jinjie Zhang, Rayan Saab. Accumulator-Aware Post-Training Quantization for Large Language Models. Trans. Mach. Learn. Res., 2025, 2025. [doi]

Abstract

Abstract is missing.