The following publications are possibly variants of this publication:
- Audio-Driven Talking Head Video Generation with Diffusion ModelYizhe Zhu, Chunhui Zhang, Qiong Liu, Xi Zhou. icassp 2023: 1-5 [doi]
- Expressive Talking Head Generation with Granular Audio-Visual ControlBorong Liang, Yan Pan, Zhizhi Guo, Hang Zhou, Zhibin Hong, Xiaoguang Han, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang 0001. cvpr 2022: 3377-3386 [doi]
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid PriorXusen Sun, Longhao Zhang, Hao Zhu 0004, Peng Zhang 0080, Bang Zhang, Xinya Ji, Kangneng Zhou, Daiheng Gao, Liefeng Bo, Xun Cao. 3dim 2025: 713-722 [doi]
- EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video DiffusionHaotian Wang, Yuzhe Weng, Yueyan Li, Zilu Guo, Jun Du, Shutong Niu, Jiefeng Ma, Shan He, Xiaoyan Wu, Qiming Hu, Bing Yin, Cong Liu, Qingfeng Liu. cvpr 2025: 26212-26221 [doi]
- Flow2Flow: Audio-visual cross-modality generation for talking face videos with rhythmic headZhangjing Wang, Wenzhi He, Yujiang Wei, Yupeng Luo. displays, 80:102552, December 2023. [doi]