A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions

Jack Hessel, Bo Pang, Zhenhai Zhu, Radu Soricut. A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions. In Mohit Bansal, Aline Villavicencio, editors, Proceedings of the 23rd Conference on Computational Natural Language Learning, CoNLL 2019, Hong Kong, China, November 3-4, 2019. pages 419-429, Association for Computational Linguistics, 2019. [doi]

Abstract

Abstract is missing.