Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT - researchr publication

researchr

You are not signed in
Sign in
Sign up

Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu. Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 4785-4789, ISCA, 2022. [doi]

Abstract is missing.

runs on WebDSL