Attention-Based Audio-Visual Fusion for Video Summarization

Yinghong Fang, Junpeng Zhang, Cewu Lu. Attention-Based Audio-Visual Fusion for Video Summarization. In Tom Gedeon, Kok Wai Wong, Minho Lee, editors, Neural Information Processing - 26th International Conference, ICONIP 2019, Sydney, NSW, Australia, December 12-15, 2019, Proceedings, Part II. Volume 11954 of Lecture Notes in Computer Science, pages 328-340, Springer, 2019. [doi]

Abstract

Abstract is missing.