Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion

Baptiste Pouthier, Laurent Pilati, Leela K. Gudupudi, Charles Bouveyron, Frédéric Precioso. Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 2381-2385, ISCA, 2021. [doi]

@inproceedings{PouthierPGBP21,
  title = {Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion},
  author = {Baptiste Pouthier and Laurent Pilati and Leela K. Gudupudi and Charles Bouveyron and Frédéric Precioso},
  year = {2021},
  doi = {10.21437/Interspeech.2021-80},
  url = {https://doi.org/10.21437/Interspeech.2021-80},
  researchr = {https://researchr.org/publication/PouthierPGBP21},
  cites = {0},
  citedby = {0},
  pages = {2381-2385},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}