MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara L. Berg, Mohit Bansal. MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 2603-2614, Association for Computational Linguistics, 2020. [doi]

@inproceedings{LeiWSYBB20,
  title = {MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning},
  author = {Jie Lei and Liwei Wang and Yelong Shen and Dong Yu and Tamara L. Berg and Mohit Bansal},
  year = {2020},
  url = {https://www.aclweb.org/anthology/2020.acl-main.233/},
  researchr = {https://researchr.org/publication/LeiWSYBB20},
  cites = {0},
  citedby = {0},
  pages = {2603-2614},
  booktitle = {Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020},
  editor = {Dan Jurafsky and Joyce Chai and Natalie Schluter and Joel R. Tetreault},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-952148-25-5},
}