Shumpei Saito, Hiroyuki Ueda, Yosuke Ito, Kazuyoshi Yoshii. Narrativity-Aware Video Summarization Based on Vision and Language Foundation Models. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 1991-1996, IEEE, 2025. [doi]
Abstract is missing.