Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar. Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. In Yi Yang, Aida Davani, Avi Sil, Anoop Kumar, editors, Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, NAACL 2024, Mexico City, Mexico, June 16-21, 2024. pages 315-323, Association for Computational Linguistics, 2024. [doi]

Authors

Zelin Wu

This author has not been identified. Look up 'Zelin Wu' in Google

Gan Song

This author has not been identified. Look up 'Gan Song' in Google

Christopher Li

This author has not been identified. Look up 'Christopher Li' in Google

Pat Rondon

This author has not been identified. Look up 'Pat Rondon' in Google

Zhong Meng

This author has not been identified. Look up 'Zhong Meng' in Google

Xavier Velez

This author has not been identified. Look up 'Xavier Velez' in Google

Weiran Wang

This author has not been identified. Look up 'Weiran Wang' in Google

Diamantino Caseiro

This author has not been identified. Look up 'Diamantino Caseiro' in Google

Golan Pundak

This author has not been identified. Look up 'Golan Pundak' in Google

Tsendsuren Munkhdalai

This author has not been identified. Look up 'Tsendsuren Munkhdalai' in Google

Angad Chandorkar

This author has not been identified. Look up 'Angad Chandorkar' in Google

Rohit Prabhavalkar

This author has not been identified. Look up 'Rohit Prabhavalkar' in Google