Mitigating Quantization Errors Due to Activation Spikes in Gated Linear Unit-Based Large Language Models

Jaewoo Yang, Hayun Kim, Junyung Ji, Younghoon Kim. Mitigating Quantization Errors Due to Activation Spikes in Gated Linear Unit-Based Large Language Models. Future Internet, 17(4):185, 2025. [doi]

Abstract

Abstract is missing.