Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Guan-Ting Lin, Prashanth Gurunath Shivakumar, Aditya Gourav, Yile Gu, Ankur Gandhe, Hung-yi Lee, Ivan Bulyko. Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar, editors, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2025, Vienna, Austria, July 27 - August 1, 2025. pages 20395-20411, Association for Computational Linguistics, 2025. [doi]

Authors

Guan-Ting Lin

This author has not been identified. Look up 'Guan-Ting Lin' in Google

Prashanth Gurunath Shivakumar

This author has not been identified. Look up 'Prashanth Gurunath Shivakumar' in Google

Aditya Gourav

This author has not been identified. Look up 'Aditya Gourav' in Google

Yile Gu

This author has not been identified. Look up 'Yile Gu' in Google

Ankur Gandhe

This author has not been identified. Look up 'Ankur Gandhe' in Google

Hung-yi Lee

This author has not been identified. Look up 'Hung-yi Lee' in Google

Ivan Bulyko

This author has not been identified. Look up 'Ivan Bulyko' in Google