Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection

Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li 0001. Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. In Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo Cesar, Florian Metze, Balakrishnan Prabhakaran, editors, MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021. pages 3927-3935, ACM, 2021. [doi]

Abstract

Abstract is missing.