Finetuning Language Models for Multimodal Question Answering

Xin Zhang, Wen Xie, Ziqi Dai, Jun Rao, Haokun Wen, Xuan Luo, Meishan Zhang, Min Zhang. Finetuning Language Models for Multimodal Question Answering. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 9420-9424, ACM, 2023. [doi]

@inproceedings{ZhangXDRWLZZ23,
  title = {Finetuning Language Models for Multimodal Question Answering},
  author = {Xin Zhang and Wen Xie and Ziqi Dai and Jun Rao and Haokun Wen and Xuan Luo and Meishan Zhang and Min Zhang},
  year = {2023},
  doi = {10.1145/3581783.3612837},
  url = {https://doi.org/10.1145/3581783.3612837},
  researchr = {https://researchr.org/publication/ZhangXDRWLZZ23},
  cites = {0},
  citedby = {0},
  pages = {9420-9424},
  booktitle = {Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023},
  editor = {Abdulmotaleb El-Saddik and Tao Mei and Rita Cucchiara and Marco Bertini 0001 and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain},
  publisher = {ACM},
}