SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information

Chih-Kai Yang, Neo Ho, Yen-Ting Piao, Hung-yi Lee. SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information. In Odette Scharenborg, Catharine Oertel, Khiet Truong, editors, 26th Annual Conference of the International Speech Communication Association, Interspeech 2025, Rotterdam, The Netherlands, 17-21 August 2025. ISCA, 2025. [doi]

Authors

Chih-Kai Yang

This author has not been identified. Look up 'Chih-Kai Yang' in Google

Neo Ho

This author has not been identified. Look up 'Neo Ho' in Google

Yen-Ting Piao

This author has not been identified. Look up 'Yen-Ting Piao' in Google

Hung-yi Lee

This author has not been identified. Look up 'Hung-yi Lee' in Google