Natural-Language-Driven Multimodal Representation Learning for Audio-Visual Scene-Aware Dialog System

Yoonseok Heo, Sangwoo Kang, Jungyun Seo. Natural-Language-Driven Multimodal Representation Learning for Audio-Visual Scene-Aware Dialog System. Sensors, 23(18):7875, September 2023. [doi]

@article{HeoKS23,
  title = {Natural-Language-Driven Multimodal Representation Learning for Audio-Visual Scene-Aware Dialog System},
  author = {Yoonseok Heo and Sangwoo Kang and Jungyun Seo},
  year = {2023},
  month = {September},
  doi = {10.3390/s23187875},
  url = {https://doi.org/10.3390/s23187875},
  researchr = {https://researchr.org/publication/HeoKS23},
  cites = {0},
  citedby = {0},
  journal = {Sensors},
  volume = {23},
  number = {18},
  pages = {7875},
}