Prompting Large Language Models with Speech Recognition Abilities

Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer. Prompting Large Language Models with Speech Recognition Abilities. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 13351-13355, IEEE, 2024. [doi]

Authors

Yassir Fathullah

This author has not been identified. Look up 'Yassir Fathullah' in Google

Chunyang Wu

This author has not been identified. Look up 'Chunyang Wu' in Google

Egor Lakomkin

This author has not been identified. Look up 'Egor Lakomkin' in Google

Junteng Jia

This author has not been identified. Look up 'Junteng Jia' in Google

Yuan Shangguan

This author has not been identified. Look up 'Yuan Shangguan' in Google

Ke Li

This author has not been identified. Look up 'Ke Li' in Google

Jinxi Guo

This author has not been identified. Look up 'Jinxi Guo' in Google

Wenhan Xiong

This author has not been identified. Look up 'Wenhan Xiong' in Google

Jay Mahadeokar

This author has not been identified. Look up 'Jay Mahadeokar' in Google

Ozlem Kalinli

This author has not been identified. Look up 'Ozlem Kalinli' in Google

Christian Fuegen

This author has not been identified. Look up 'Christian Fuegen' in Google

Mike Seltzer

This author has not been identified. Look up 'Mike Seltzer' in Google