Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference

Benjamin Hawks, Javier M. Duarte, Nicholas J. Fraser, Alessandro Pappalardo, Nhan Tran, Yaman Umuroglu. Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference. Frontiers Artif. Intell., 4:676564, 2021. [doi]

Authors

Benjamin Hawks

This author has not been identified. Look up 'Benjamin Hawks' in Google

Javier M. Duarte

This author has not been identified. Look up 'Javier M. Duarte' in Google

Nicholas J. Fraser

This author has not been identified. Look up 'Nicholas J. Fraser' in Google

Alessandro Pappalardo

This author has not been identified. Look up 'Alessandro Pappalardo' in Google

Nhan Tran

This author has not been identified. Look up 'Nhan Tran' in Google

Yaman Umuroglu

This author has not been identified. Look up 'Yaman Umuroglu' in Google