GWQ: Group-Wise Quantization Framework for Neural Networks

Jiaming Yang, Chenwei Tang, Caiyang Yu, Jiancheng Lv 0001. GWQ: Group-Wise Quantization Framework for Neural Networks. In Berrin Yanikoglu, Wray L. Buntine, editors, Asian Conference on Machine Learning, 11-14 November 2023, Istanbul, Turkey. Volume 222 of Proceedings of Machine Learning Research, pages 1526-1541, PMLR, 2023. [doi]

Abstract

Abstract is missing.