End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features

Chiori Hori, Huda Alamri, Jue Wang 0010, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh. End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, United Kingdom, May 12-17, 2019. pages 2352-2356, IEEE, 2019. [doi]

@inproceedings{HoriA0WHCMCLDEB19,
  title = {End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features},
  author = {Chiori Hori and Huda Alamri and Jue Wang 0010 and Gordon Wichern and Takaaki Hori and Anoop Cherian and Tim K. Marks and Vincent Cartillier and Raphael Gontijo Lopes and Abhishek Das and Irfan Essa and Dhruv Batra and Devi Parikh},
  year = {2019},
  doi = {10.1109/ICASSP.2019.8682583},
  url = {https://doi.org/10.1109/ICASSP.2019.8682583},
  researchr = {https://researchr.org/publication/HoriA0WHCMCLDEB19},
  cites = {0},
  citedby = {0},
  pages = {2352-2356},
  booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, United Kingdom, May 12-17, 2019},
  publisher = {IEEE},
  isbn = {978-1-4799-8131-1},
}