Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

Shachi H. Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman. Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog. In Visually Grounded Interaction and Language (ViGIL), NeurIPS 2019 Workshop, Vancouver, Canada, December 13, 2019. 2019. [doi]

@inproceedings{KumarOSHN19,
  title = {Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog},
  author = {Shachi H. Kumar and Eda Okur and Saurav Sahay and Jonathan Huang and Lama Nachman},
  year = {2019},
  url = {https://vigilworkshop.github.io/static/papers/39.pdf},
  researchr = {https://researchr.org/publication/KumarOSHN19},
  cites = {0},
  citedby = {0},
  booktitle = {Visually Grounded Interaction and Language (ViGIL), NeurIPS 2019 Workshop, Vancouver, Canada, December 13, 2019},
}