Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset

Wentao Mo, Qingchao Chen, Yuxin Peng 0001, Siyuan Huang, Yang Liu 0105. Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset. In Cathal Gurrin, Klaus Schoeffmann, Min Zhang, Luca Rossetto, Stevan Rudinac, Duc-Tien Dang-Nguyen, Wen-Huang Cheng, Phoebe Chen, Jenny Benois-Pineau, editors, Proceedings of the 33rd ACM International Conference on Multimedia, MM 2025, Dublin, Ireland, October 27-31, 2025. pages 12973-12980, ACM, 2025. [doi]

Authors

Wentao Mo

This author has not been identified. Look up 'Wentao Mo' in Google

Qingchao Chen

This author has not been identified. Look up 'Qingchao Chen' in Google

Yuxin Peng 0001

This author has not been identified. Look up 'Yuxin Peng 0001' in Google

Siyuan Huang

This author has not been identified. Look up 'Siyuan Huang' in Google

Yang Liu 0105

This author has not been identified. Look up 'Yang Liu 0105' in Google