A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm

Ben Keller, Rangharajan Venkatesan, Steve Dai, Stephen G. Tell, Brian Zimmer, Charbel Sakr, William J. Dally, C. Thomas Gray, Brucek Khailany. A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm. J. Solid-State Circuits, 58(4):1129-1141, 2023. [doi]

Authors

Ben Keller

This author has not been identified. Look up 'Ben Keller' in Google

Rangharajan Venkatesan

This author has not been identified. Look up 'Rangharajan Venkatesan' in Google

Steve Dai

This author has not been identified. Look up 'Steve Dai' in Google

Stephen G. Tell

This author has not been identified. Look up 'Stephen G. Tell' in Google

Brian Zimmer

This author has not been identified. Look up 'Brian Zimmer' in Google

Charbel Sakr

This author has not been identified. Look up 'Charbel Sakr' in Google

William J. Dally

This author has not been identified. Look up 'William J. Dally' in Google

C. Thomas Gray

This author has not been identified. Look up 'C. Thomas Gray' in Google

Brucek Khailany

This author has not been identified. Look up 'Brucek Khailany' in Google