Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment

Cong-Duy Nguyen, The-Anh Vu-Le, Thong Nguyen, Tho Quan, Anh Tuan Luu. Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 5665-5673, ACM, 2023. [doi]

@inproceedings{NguyenVNQL23,
  title = {Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment},
  author = {Cong-Duy Nguyen and The-Anh Vu-Le and Thong Nguyen and Tho Quan and Anh Tuan Luu},
  year = {2023},
  doi = {10.1145/3581783.3612248},
  url = {https://doi.org/10.1145/3581783.3612248},
  researchr = {https://researchr.org/publication/NguyenVNQL23},
  cites = {0},
  citedby = {0},
  pages = {5665-5673},
  booktitle = {Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023},
  editor = {Abdulmotaleb El-Saddik and Tao Mei and Rita Cucchiara and Marco Bertini 0001 and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain},
  publisher = {ACM},
}