VIT-LENS: Towards Omni-modal Representations

Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou. VIT-LENS: Towards Omni-modal Representations. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 26637-26647, IEEE, 2024. [doi]

@inproceedings{LeiGYZGSGSS24,
  title = {VIT-LENS: Towards Omni-modal Representations},
  author = {Weixian Lei and Yixiao Ge and Kun Yi and Jianfeng Zhang and Difei Gao and Dylan Sun and Yuying Ge and Ying Shan and Mike Zheng Shou},
  year = {2024},
  doi = {10.1109/CVPR52733.2024.02516},
  url = {https://doi.org/10.1109/CVPR52733.2024.02516},
  researchr = {https://researchr.org/publication/LeiGYZGSGSS24},
  cites = {0},
  citedby = {0},
  pages = {26637-26647},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-5300-6},
}