M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training

Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang 0001, Nan Duan. M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 3977-3986, Computer Vision Foundation / IEEE, 2021. [doi]

Authors

Minheng Ni

This author has not been identified. Look up 'Minheng Ni' in Google

Haoyang Huang

This author has not been identified. Look up 'Haoyang Huang' in Google

Lin Su

This author has not been identified. Look up 'Lin Su' in Google

Edward Cui

This author has not been identified. Look up 'Edward Cui' in Google

Taroon Bharti

This author has not been identified. Look up 'Taroon Bharti' in Google

Lijuan Wang

This author has not been identified. Look up 'Lijuan Wang' in Google

Dongdong Zhang 0001

This author has not been identified. Look up 'Dongdong Zhang 0001' in Google

Nan Duan

This author has not been identified. Look up 'Nan Duan' in Google