Scene-text Oriented Visual Entailment: Task, Dataset and Solution

Nan Li, Pijian Li, Dongsheng Xu, Wenye Zhao, Yi Cai, Qingbao Huang. Scene-text Oriented Visual Entailment: Task, Dataset and Solution. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 5562-5571, ACM, 2023. [doi]

Abstract

Abstract is missing.