Scalable Low-Latency Persistent Neural Machine Translation on CPU Server with Multiple FPGAs

Eriko Nurvitadhi, Mishali Naik, Andrew Boutros, Prerna Budhkar, Ali Jafari, Dongup Kwon, David Sheffield, Abirami Prabhakaran, Karthik Gururaj, Pranavi Appana. Scalable Low-Latency Persistent Neural Machine Translation on CPU Server with Multiple FPGAs. In International Conference on Field-Programmable Technology, ICFPT 2019, Tianjin, China, December 9-13, 2019. pages 307-310, IEEE, 2019. [doi]

Abstract

Abstract is missing.