DSAMT: Dual-Source Aligned Multimodal Transformers for TextCaps

Chenyang Liao, Ruifang Liu, Sheng Gao. DSAMT: Dual-Source Aligned Multimodal Transformers for TextCaps. In 7th IEEE International Conference on Network Intelligence and Digital Content, IC-NIDC 2021, Beijing, China, November 17-19, 2021. pages 373-377, IEEE, 2021. [doi]

@inproceedings{LiaoLG21,
  title = {DSAMT: Dual-Source Aligned Multimodal Transformers for TextCaps},
  author = {Chenyang Liao and Ruifang Liu and Sheng Gao},
  year = {2021},
  doi = {10.1109/IC-NIDC54101.2021.9660575},
  url = {https://doi.org/10.1109/IC-NIDC54101.2021.9660575},
  researchr = {https://researchr.org/publication/LiaoLG21},
  cites = {0},
  citedby = {0},
  pages = {373-377},
  booktitle = {7th IEEE International Conference on Network Intelligence and Digital Content, IC-NIDC 2021, Beijing, China, November 17-19, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-0582-9},
}