VIT-LENS: Towards Omni-modal Representations

Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou. VIT-LENS: Towards Omni-modal Representations. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 26637-26647, IEEE, 2024. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.