Generating Natural Video Descriptions via Multimodal Processing

Qin Jin, Junwei Liang, Xiaozhu Lin. Generating Natural Video Descriptions via Multimodal Processing. In Nelson Morgan, editor, Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016. pages 570-574, ISCA, 2016. [doi]

Abstract

Abstract is missing.