GWQ: Group-Wise Quantization Framework for Neural Networks

Jiaming Yang, Chenwei Tang, Caiyang Yu, Jiancheng Lv 0001. GWQ: Group-Wise Quantization Framework for Neural Networks. In Berrin Yanikoglu, Wray L. Buntine, editors, Asian Conference on Machine Learning, 11-14 November 2023, Istanbul, Turkey. Volume 222 of Proceedings of Machine Learning Research, pages 1526-1541, PMLR, 2023. [doi]

@inproceedings{YangTY023,
  title = {GWQ: Group-Wise Quantization Framework for Neural Networks},
  author = {Jiaming Yang and Chenwei Tang and Caiyang Yu and Jiancheng Lv 0001},
  year = {2023},
  url = {https://proceedings.mlr.press/v222/yang24a.html},
  researchr = {https://researchr.org/publication/YangTY023},
  cites = {0},
  citedby = {0},
  pages = {1526-1541},
  booktitle = {Asian Conference on Machine Learning, 11-14 November 2023, Istanbul, Turkey},
  editor = {Berrin Yanikoglu and Wray L. Buntine},
  volume = {222},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}