Separate and Locate: Rethink the Text in Text-based Visual Question Answering

Chengyang Fang, Jiangnan Li, Liang Li, Can Ma, Dayong Hu. Separate and Locate: Rethink the Text in Text-based Visual Question Answering. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 4378-4388, ACM, 2023. [doi]

Authors

Chengyang Fang

This author has not been identified. Look up 'Chengyang Fang' in Google

Jiangnan Li

This author has not been identified. Look up 'Jiangnan Li' in Google

Liang Li

This author has not been identified. Look up 'Liang Li' in Google

Can Ma

This author has not been identified. Look up 'Can Ma' in Google

Dayong Hu

This author has not been identified. Look up 'Dayong Hu' in Google