Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo, Kwonjoon Lee. Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 18580-18590, IEEE, 2024. [doi]

Abstract

Abstract is missing.