MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara L. Berg, Mohit Bansal. MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 2603-2614, Association for Computational Linguistics, 2020. [doi]

Authors

Jie Lei

This author has not been identified. Look up 'Jie Lei' in Google

Liwei Wang

This author has not been identified. Look up 'Liwei Wang' in Google

Yelong Shen

This author has not been identified. Look up 'Yelong Shen' in Google

Dong Yu

This author has not been identified. Look up 'Dong Yu' in Google

Tamara L. Berg

This author has not been identified. Look up 'Tamara L. Berg' in Google

Mohit Bansal

This author has not been identified. Look up 'Mohit Bansal' in Google