M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training

Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang 0001, Nan Duan. M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 3977-3986, Computer Vision Foundation / IEEE, 2021. [doi]

@inproceedings{NiHSCBW0D21,
  title = {M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training},
  author = {Minheng Ni and Haoyang Huang and Lin Su and Edward Cui and Taroon Bharti and Lijuan Wang and Dongdong Zhang 0001 and Nan Duan},
  year = {2021},
  url = {https://openaccess.thecvf.com/content/CVPR2021/html/Ni_M3P_Learning_Universal_Representations_via_Multitask_Multilingual_Multimodal_Pre-Training_CVPR_2021_paper.html},
  researchr = {https://researchr.org/publication/NiHSCBW0D21},
  cites = {0},
  citedby = {0},
  pages = {3977-3986},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021},
  publisher = {Computer Vision Foundation / IEEE},
}