Improving system latency of AI accelerator with on-chip pipelined activation preprocessing and multi-mode batch inference

Wenxuan Chen, Zheng Wang, Ming Lei, Bo Dong, Zhuo Wang, Yongkui Yang, Chao Chen 0022, Weiyu Guo, Chen Liang, Qian Zhang, Wenqi Fang, Zhibin Yu. Improving system latency of AI accelerator with on-chip pipelined activation preprocessing and multi-mode batch inference. In 3rd IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2021, Washington, DC, USA, June 6-9, 2021. pages 1-4, IEEE, 2021. [doi]

Authors

Wenxuan Chen

This author has not been identified. Look up 'Wenxuan Chen' in Google

Zheng Wang

This author has not been identified. Look up 'Zheng Wang' in Google

Ming Lei

This author has not been identified. Look up 'Ming Lei' in Google

Bo Dong

This author has not been identified. Look up 'Bo Dong' in Google

Zhuo Wang

This author has not been identified. Look up 'Zhuo Wang' in Google

Yongkui Yang

This author has not been identified. Look up 'Yongkui Yang' in Google

Chao Chen 0022

This author has not been identified. Look up 'Chao Chen 0022' in Google

Weiyu Guo

This author has not been identified. Look up 'Weiyu Guo' in Google

Chen Liang

This author has not been identified. Look up 'Chen Liang' in Google

Qian Zhang

This author has not been identified. Look up 'Qian Zhang' in Google

Wenqi Fang

This author has not been identified. Look up 'Wenqi Fang' in Google

Zhibin Yu

This author has not been identified. Look up 'Zhibin Yu' in Google