Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks

Julieta Martinez, Jashan Shewakramani, Ting-Wei Liu, Ioan Andrei Barsan, Wenyuan Zeng, Raquel Urtasun. Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 15699-15708, Computer Vision Foundation / IEEE, 2021. [doi]

Abstract

Abstract is missing.