AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang 0001, Mark D. Plumbley. AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 21450-21474, PMLR, 2023. [doi]

Authors

Haohe Liu

This author has not been identified. Look up 'Haohe Liu' in Google

Zehua Chen

This author has not been identified. Look up 'Zehua Chen' in Google

Yi Yuan

This author has not been identified. Look up 'Yi Yuan' in Google

Xinhao Mei

This author has not been identified. Look up 'Xinhao Mei' in Google

Xubo Liu

This author has not been identified. Look up 'Xubo Liu' in Google

Danilo P. Mandic

This author has not been identified. Look up 'Danilo P. Mandic' in Google

Wenwu Wang 0001

This author has not been identified. Look up 'Wenwu Wang 0001' in Google

Mark D. Plumbley

This author has not been identified. Look up 'Mark D. Plumbley' in Google