Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information

Weijie Su 0002, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou 0001, Jifeng Dai. Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 15888-15899, IEEE, 2023. [doi]

@inproceedings{0002ZTLLHQWZD23,
  title = {Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information},
  author = {Weijie Su 0002 and Xizhou Zhu and Chenxin Tao and Lewei Lu and Bin Li and Gao Huang and Yu Qiao and Xiaogang Wang and Jie Zhou 0001 and Jifeng Dai},
  year = {2023},
  doi = {10.1109/CVPR52729.2023.01525},
  url = {https://doi.org/10.1109/CVPR52729.2023.01525},
  researchr = {https://researchr.org/publication/0002ZTLLHQWZD23},
  cites = {0},
  citedby = {0},
  pages = {15888-15899},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0129-8},
}