A Hierarchical Deep Video Understanding Method with Shot-Based Instance Search and Large Language Model

Ruizhe Li, Jiahao Guo, Mingxi Li, Zhengqian Wu, Chao Liang. A Hierarchical Deep Video Understanding Method with Shot-Based Instance Search and Large Language Model. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 9425-9429, ACM, 2023. [doi]

Abstract

Abstract is missing.