Multi-Modal Learning with Text Merging for TEXTVQA

Changsheng Xu, Zhenlong Xu, Yifan He, Shuigeng Zhou, Jihong Guan. Multi-Modal Learning with Text Merging for TEXTVQA. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 1985-1989, IEEE, 2022. [doi]

Authors

Changsheng Xu

This author has not been identified. Look up 'Changsheng Xu' in Google

Zhenlong Xu

This author has not been identified. Look up 'Zhenlong Xu' in Google

Yifan He

This author has not been identified. Look up 'Yifan He' in Google

Shuigeng Zhou

This author has not been identified. Look up 'Shuigeng Zhou' in Google

Jihong Guan

This author has not been identified. Look up 'Jihong Guan' in Google