Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion

Baptiste Pouthier, Laurent Pilati, Leela K. Gudupudi, Charles Bouveyron, Frédéric Precioso. Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-Based Multimodal Fusion. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 2381-2385, ISCA, 2021. [doi]

Abstract

Abstract is missing.