Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution

Qihao Liu, Xi Yin 0001, Alan L. Yuille, Andrew Brown, Mannat Singh. Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 2755-2765, Computer Vision Foundation / IEEE, 2025. [doi]

Abstract

Abstract is missing.