Joonghyuk Shin, Alchan Hwang, Yujin Kim, Daneul Kim, Jaesik Park. Exploring Multimodal Diffusion Transformers for Enhanced Prompt-Based Image Editing. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 19492-19502, IEEE, 2025. [doi]
Abstract is missing.