Soon Yau Cheong, Armin Mustafa, Andrew Gilbert. [inline-graphic not available: see fulltext] : Bridging and Harmonizing [inline-graphic not available: see fulltext] and Textual Conditioning for [inline-graphic not available: see fulltext]. In Alessio Del Bue, Cristian Canton, Jordi Pont-Tuset, Tatiana Tommasi, editors, Computer Vision - ECCV 2024 Workshops - Milan, Italy, September 29-October 4, 2024, Proceedings, Part I. Volume 15623 of Lecture Notes in Computer Science, pages 267-285, Springer, 2024. [doi]
Abstract is missing.