OECA-Net: A co-attention network for visual question answering based on OCR scene text feature enhancement

Feng Yan, Wushouer Silamu, Yachuang Chai, Yanbing Li. OECA-Net: A co-attention network for visual question answering based on OCR scene text feature enhancement. Multimedia Tools Appl., 83(3):7085-7096, January 2024. [doi]

@article{YanSCL24,
  title = {OECA-Net: A co-attention network for visual question answering based on OCR scene text feature enhancement},
  author = {Feng Yan and Wushouer Silamu and Yachuang Chai and Yanbing Li},
  year = {2024},
  month = {January},
  doi = {10.1007/s11042-023-15418-6},
  url = {https://doi.org/10.1007/s11042-023-15418-6},
  researchr = {https://researchr.org/publication/YanSCL24},
  cites = {0},
  citedby = {0},
  journal = {Multimedia Tools Appl.},
  volume = {83},
  number = {3},
  pages = {7085-7096},
}