Eriko Nurvitadhi, Mishali Naik, Andrew Boutros, Prerna Budhkar, Ali Jafari, Dongup Kwon, David Sheffield, Abirami Prabhakaran, Karthik Gururaj, Pranavi Appana. Scalable Low-Latency Persistent Neural Machine Translation on CPU Server with Multiple FPGAs. In International Conference on Field-Programmable Technology, ICFPT 2019, Tianjin, China, December 9-13, 2019. pages 307-310, IEEE, 2019. [doi]
Abstract is missing.