VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

Steve Dai, Rangharajan Venkatesan, Mark Ren, Brian Zimmer, William J. Dally, Brucek Khailany. VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference. In Alex Smola, Alex Dimakis, Ion Stoica, editors, Proceedings of Machine Learning and Systems 2021, MLSys 2021, virtual, April 5-9, 2021. mlsys.org, 2021. [doi]

@inproceedings{DaiVRZDK21,
  title = {VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference},
  author = {Steve Dai and Rangharajan Venkatesan and Mark Ren and Brian Zimmer and William J. Dally and Brucek Khailany},
  year = {2021},
  url = {https://proceedings.mlsys.org/paper/2021/hash/f0935e4cd5920aa6c7c996a5ee53a70f-Abstract.html},
  researchr = {https://researchr.org/publication/DaiVRZDK21},
  cites = {0},
  citedby = {0},
  booktitle = {Proceedings of Machine Learning and Systems 2021, MLSys 2021, virtual, April 5-9, 2021},
  editor = {Alex Smola and Alex Dimakis and Ion Stoica},
  publisher = {mlsys.org},
}