Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar. Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. In Yi Yang, Aida Davani, Avi Sil, Anoop Kumar, editors, Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, NAACL 2024, Mexico City, Mexico, June 16-21, 2024. pages 315-323, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.