Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan. MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 26396-26406, IEEE, 2024. [doi]
Abstract is missing.