Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion

Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer. Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 1772-1776, ISCA, 2021. [doi]

Authors

Duc Le

This author has not been identified. Look up 'Duc Le' in Google

Mahaveer Jain

This author has not been identified. Look up 'Mahaveer Jain' in Google

Gil Keren

This author has not been identified. Look up 'Gil Keren' in Google

Suyoun Kim

This author has not been identified. Look up 'Suyoun Kim' in Google

Yangyang Shi

This author has not been identified. Look up 'Yangyang Shi' in Google

Jay Mahadeokar

This author has not been identified. Look up 'Jay Mahadeokar' in Google

Julian Chan

This author has not been identified. Look up 'Julian Chan' in Google

Yuan Shangguan

This author has not been identified. Look up 'Yuan Shangguan' in Google

Christian Fuegen

This author has not been identified. Look up 'Christian Fuegen' in Google

Ozlem Kalinli

This author has not been identified. Look up 'Ozlem Kalinli' in Google

Yatharth Saraf

This author has not been identified. Look up 'Yatharth Saraf' in Google

Michael L. Seltzer

This author has not been identified. Look up 'Michael L. Seltzer' in Google