Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

Junwen Xiong, Yu Zhou, Peng Zhang 0005, Lei Xie 0001, Wei Huang 0013, Yufei Zha. Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement. IEEE Transactions on Multimedia, 25:5800-5812, 2023. [doi]

Abstract

Abstract is missing.