Mix-GEMM: An efficient HW-SW Architecture for Mixed-Precision Quantized Deep Neural Networks Inference on Edge Devices

Enrico Reggiani, Alessandro Pappalardo, Max Doblas, Miquel Moretó, Mauro Olivieri, Osman Sabri Unsal, Adrián Cristal. Mix-GEMM: An efficient HW-SW Architecture for Mixed-Precision Quantized Deep Neural Networks Inference on Edge Devices. In IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023, Montreal, QC, Canada, February 25 - March 1, 2023. pages 1085-1098, IEEE, 2023. [doi]

Authors

Enrico Reggiani

This author has not been identified. Look up 'Enrico Reggiani' in Google

Alessandro Pappalardo

This author has not been identified. Look up 'Alessandro Pappalardo' in Google

Max Doblas

This author has not been identified. Look up 'Max Doblas' in Google

Miquel Moretó

This author has not been identified. Look up 'Miquel Moretó' in Google

Mauro Olivieri

This author has not been identified. Look up 'Mauro Olivieri' in Google

Osman Sabri Unsal

This author has not been identified. Look up 'Osman Sabri Unsal' in Google

Adrián Cristal

This author has not been identified. Look up 'Adrián Cristal' in Google