Swinv2-Imagen: hierarchical vision transformer diffusion models for text-to-image generation

Ruijun Li, Weihua Li 0007, Yi Yang 0036, Hanyu Wei, Jianhua Jiang, Quan Bai 0001. Swinv2-Imagen: hierarchical vision transformer diffusion models for text-to-image generation. Neural Computing and Applications, 36(28):17245-17260, October 2024. [doi]

Abstract

Abstract is missing.