Variational Stacked Local Attention Networks for Diverse Video Captioning

Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M. Ashraful Amin, A. K. M. Mahbubur Rahman. Variational Stacked Local Attention Networks for Diverse Video Captioning. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022. pages 2493-2502, IEEE, 2022. [doi]

Abstract

Abstract is missing.