Variational Stacked Local Attention Networks for Diverse Video Captioning

Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M. Ashraful Amin, A. K. M. Mahbubur Rahman. Variational Stacked Local Attention Networks for Diverse Video Captioning. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022. pages 2493-2502, IEEE, 2022. [doi]

@inproceedings{DebSBAAR22,
  title = {Variational Stacked Local Attention Networks for Diverse Video Captioning},
  author = {Tonmoay Deb and Akib Sadmanee and Kishor Kumar Bhaumik and Amin Ahsan Ali and M. Ashraful Amin and A. K. M. Mahbubur Rahman},
  year = {2022},
  doi = {10.1109/WACV51458.2022.00255},
  url = {https://doi.org/10.1109/WACV51458.2022.00255},
  researchr = {https://researchr.org/publication/DebSBAAR22},
  cites = {0},
  citedby = {0},
  pages = {2493-2502},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-0915-5},
}