GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos

Tomás Soucek, Dima Damen, Michael Wray, Ivan Laptev, Josef Sivic. GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 6561-6571, IEEE, 2024. [doi]

@inproceedings{SoucekDWLS24,
  title = {GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos},
  author = {Tomás Soucek and Dima Damen and Michael Wray and Ivan Laptev and Josef Sivic},
  year = {2024},
  doi = {10.1109/CVPR52733.2024.00627},
  url = {https://doi.org/10.1109/CVPR52733.2024.00627},
  researchr = {https://researchr.org/publication/SoucekDWLS24},
  cites = {0},
  citedby = {0},
  pages = {6561-6571},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-5300-6},
}