Shachi H. Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman. Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog. In Visually Grounded Interaction and Language (ViGIL), NeurIPS 2019 Workshop, Vancouver, Canada, December 13, 2019. 2019.
No references recorded for this publication.
No citations of this publication recorded.