Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M. Ashraful Amin, A. K. M. Mahbubur Rahman. Variational Stacked Local Attention Networks for Diverse Video Captioning. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022. pages 2493-2502, IEEE, 2022. [doi]
Abstract is missing.