The following publications are possibly variants of this publication:
- Cascaded Diffusion Models for High Fidelity Image GenerationJonathan Ho, Chitwan Saharia, William Chan, David J. Fleet, Mohammad Norouzi 0002, Tim Salimans. jmlr, 23, 2022. [doi]
- VideoFusion: Decomposed Diffusion Models for High-Quality Video GenerationZhengxiong Luo, Dayou Chen, Yingya Zhang, Yan Huang, Liang Wang, Yujun Shen, Deli Zhao, Jingren Zhou, Tieniu Tan. cvpr 2023: 10209-10218 [doi]
- Align Your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsAndreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim 0001, Sanja Fidler, Karsten Kreis. cvpr 2023: 22563-22575 [doi]
- DrivingGen: Efficient Safety-Critical Driving Video Generation with Latent Diffusion ModelsZipeng Guo, Yuchen Zhou, Chao Gou. icmcs 2024: 1-6 [doi]
- Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationDavid Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao 0001, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou. ijcv, 133(4):1879-1893, April 2025. [doi]
- MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video GenerationMingzhen Sun, Weining Wang, Yanyuan Qiao, Jiahui Sun, Zihan Qin, Longteng Guo, Xinxin Zhu, Jing Liu 0001. mm 2024: 10853-10861 [doi]