Unifying 2D and 3D Vision-Language Understanding

Ayush Jain, Alexander Swerdlow, Yuzhou Wang, Sergio Arnaud, Ada Martin, Alexander Sax, Franziska Meier, Katerina Fragkiadaki. Unifying 2D and 3D Vision-Language Understanding. In Forty-second International Conference on Machine Learning, ICML 2025, Vancouver, BC, Canada, July 13-19, 2025. OpenReview.net, 2025. [doi]

Authors

Ayush Jain

This author has not been identified. Look up 'Ayush Jain' in Google

Alexander Swerdlow

This author has not been identified. Look up 'Alexander Swerdlow' in Google

Yuzhou Wang

This author has not been identified. Look up 'Yuzhou Wang' in Google

Sergio Arnaud

This author has not been identified. Look up 'Sergio Arnaud' in Google

Ada Martin

This author has not been identified. Look up 'Ada Martin' in Google

Alexander Sax

This author has not been identified. Look up 'Alexander Sax' in Google

Franziska Meier

This author has not been identified. Look up 'Franziska Meier' in Google

Katerina Fragkiadaki

This author has not been identified. Look up 'Katerina Fragkiadaki' in Google