Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang. An Audio-Textual Diffusion Model for Converting Speech Signals into Ultrasound Tongue Imaging Data. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 2170-2174, IEEE, 2024. [doi]
Abstract is missing.