Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning

Xu Yang, Hanwang Zhang, Chongyang Gao, Jianfei Cai 0001. Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning. International Journal of Computer Vision, 131(1):82-100, 2023. [doi]

@article{YangZGC23,
  title = {Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning},
  author = {Xu Yang and Hanwang Zhang and Chongyang Gao and Jianfei Cai 0001},
  year = {2023},
  doi = {10.1007/s11263-022-01692-8},
  url = {https://doi.org/10.1007/s11263-022-01692-8},
  researchr = {https://researchr.org/publication/YangZGC23},
  cites = {0},
  citedby = {0},
  journal = {International Journal of Computer Vision},
  volume = {131},
  number = {1},
  pages = {82-100},
}