EgoLM: Multi-Modal Language Model of Egocentric Motions

Fangzhou Hong, Vladimir Guzov, Hyo-Jin Kim, Yuting Ye, Richard A. Newcombe, Ziwei Liu, Lingni Ma. EgoLM: Multi-Modal Language Model of Egocentric Motions. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 5344-5354, Computer Vision Foundation / IEEE, 2025. [doi]

Abstract

Abstract is missing.