Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

Shachi H. Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman. Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog. In Visually Grounded Interaction and Language (ViGIL), NeurIPS 2019 Workshop, Vancouver, Canada, December 13, 2019. 2019. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: