Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos

Guyue Hu 0001, Bin He, Hanwang Zhang. Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos. Int. J. Autom. Comput., 20(2):249-262, April 2023. [doi]

Abstract

Abstract is missing.