AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking

Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez. AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking. In Samy Bengio, Hervé Bourlard, editors, Machine Learning for Multimodal Interaction, First International Workshop,MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers. Volume 3361 of Lecture Notes in Computer Science, pages 182-195, Springer, 2004. [doi]

Abstract

Abstract is missing.