MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Ludan Ruan, Yiyang Ma, Huan Yang 0005, Huiguo He, Bei Liu 0001, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo. MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 10219-10228, IEEE, 2023. [doi]

Authors

Ludan Ruan

This author has not been identified. Look up 'Ludan Ruan' in Google

Yiyang Ma

This author has not been identified. Look up 'Yiyang Ma' in Google

Huan Yang 0005

This author has not been identified. Look up 'Huan Yang 0005' in Google

Huiguo He

This author has not been identified. Look up 'Huiguo He' in Google

Bei Liu 0001

This author has not been identified. Look up 'Bei Liu 0001' in Google

Jianlong Fu

This author has not been identified. Look up 'Jianlong Fu' in Google

Nicholas Jing Yuan

This author has not been identified. Look up 'Nicholas Jing Yuan' in Google

Qin Jin

This author has not been identified. Look up 'Qin Jin' in Google

Baining Guo

This author has not been identified. Look up 'Baining Guo' in Google