LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism

Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Shuaiwen Leon Song, Ang Li, Martin C. Herbordt. LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism. In 30th IEEE International Conference on Application-specific Systems, Architectures and Processors, ASAP 2019, New York, NY, USA, July 15-17, 2019. pages 9-16, IEEE, 2019. [doi]

Abstract

Abstract is missing.