Tomás Soucek, Dima Damen, Michael Wray, Ivan Laptev, Josef Sivic. GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 6561-6571, IEEE, 2024. [doi]
Abstract is missing.