Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation

Yu Zhao, Jianguo Wei, Zhichao Lin, Yueheng Sun, Meishan Zhang, Min Zhang 0005. Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11. pages 1437-1449, Association for Computational Linguistics, 2022. [doi]

Abstract

Abstract is missing.