Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models

Bella Septina Ika Hartanti, Dipta Tanaya, Kurniawati Azizah, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti. Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models. In 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2023, Delhi, India, December 4-6, 2023. pages 1-6, IEEE, 2023. [doi]

Abstract

Abstract is missing.