The following publications are possibly variants of this publication:
- DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion ModelsSicheng Yang, Zhiyong Wu 0001, Minglei Li, Zhensong Zhang, Lei Hao, Weihong Bao, Ming Cheng, Long Xiao. IJCAI 2023: 5860-5868 [doi]
- Audio-Driven Talking Head Video Generation with Diffusion ModelYizhe Zhu, Chunhui Zhang, Qiong Liu, Xi Zhou. icassp 2023: 1-5 [doi]
- Text-Driven Synchronized Diffusion Video and Audio Talking Head GenerationZhenfei Zhang, Tsung-Wei Huang, Guan-Ming Su, Ming-Ching Chang, Xin Li. miproBIS 2024: 61-67 [doi]
- Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion ModelXu He, Qiaochu Huang, Zhensong Zhang, Zhiwei Lin, Zhiyong Wu 0001, Sicheng Yang, Minglei Li 0001, Zhiyi Chen, Songcen Xu, Xiaofei Wu. cvpr 2024: 2263-2273 [doi]
- Audio-Driven Talking Face Video Generation With Dynamic Convolution KernelsZipeng Ye, Mengfei Xia, Ran Yi, Juyong Zhang, Yu-Kun Lai, Xuwei Huang, Guo-Xin Zhang, Yong-Jin Liu. tmm, 25:2033-2046, 2023. [doi]