LightSeq: A High Performance Inference Library for Transformers

Xiaohui Wang, Ying Xiong, Yang Wei, Mingxuan Wang, Lei Li 0005. LightSeq: A High Performance Inference Library for Transformers. In Young-Bum Kim, Yunyao Li, Owen Rambow, editors, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, NAACL-HLT 2021, Online, June 6-11, 2021. pages 113-120, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.