Rule-Embedded Network for Audio-Visual Voice Activity Detection in Live Musical Video Streams

Yuanbo Hou, Yi Deng, Bilei Zhu, Zejun Ma, Dick Botteldooren. Rule-Embedded Network for Audio-Visual Voice Activity Detection in Live Musical Video Streams. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. pages 4165-4169, IEEE, 2021. [doi]

Abstract

Abstract is missing.