Video Captioning Using Attention Based Visual Fusion with Bi-temporal Context and Bi-modal Semantic Feature Learning

Noorhan K. Fawzy, Mohammed A. Marey, Mostafa M. Aref. Video Captioning Using Attention Based Visual Fusion with Bi-temporal Context and Bi-modal Semantic Feature Learning. In Aboul Ella Hassanien, Adam Slowik, Václav Snásel, Hisham El-Deeb, Fahmy M. Tolba, editors, Proceedings of the International Conference on Advanced Intelligent Systems and Informatics, AISI 2020, Cairo, Egypt, 19-21 October 2020. Volume 1261 of Advances in Intelligent Systems and Computing, pages 65-78, Springer, 2020. [doi]

Abstract

Abstract is missing.