Hearing Faces: Target Speaker Text-to-Speech Synthesis from a Face

Björn Plüster, Cornelius Weber, Leyuan Qu, Stefan Wermter. Hearing Faces: Target Speaker Text-to-Speech Synthesis from a Face. In IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021, Cartagena, Colombia, December 13-17, 2021. pages 757-764, IEEE, 2021. [doi]

Abstract

Abstract is missing.