Junchen Lu, Berrak Sisman, Mingyang Zhang 0003, Haizhou Li 0001. High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. In Naomi Harte, Julie Carson-Berndsen, Gareth Jones, editors, 24th Annual Conference of the International Speech Communication Association, Interspeech 2023, Dublin, Ireland, August 20-24, 2023. pages 5536-5540, ISCA, 2023. [doi]