Generating Natural Video Descriptions via Multimodal Processing

Qin Jin, Junwei Liang, Xiaozhu Lin. Generating Natural Video Descriptions via Multimodal Processing. In Nelson Morgan, editor, Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016. pages 570-574, ISCA, 2016. [doi]

Authors

Qin Jin

This author has not been identified. Look up 'Qin Jin' in Google

Junwei Liang

This author has not been identified. Look up 'Junwei Liang' in Google

Xiaozhu Lin

This author has not been identified. Look up 'Xiaozhu Lin' in Google