Wen Cheng, Shichen Dong, Jiayu Qin, Wei Wang. QAQ: Quality Adaptive Quantization for LLM KV Cache. In IEEE/CVF International Conference on Computer Vision, ICCV 2025 - Workshops, Honolulu, HI, USA, October 19-20, 2025. pages 2563-2571, IEEE, 2025. [doi]
Abstract is missing.