Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing

Ross Cutler, Ramin Mehran, Sam Johnson, Cha Zhang, Adam Kirk, Oliver Whyte, Adarsh Kowdle. Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 4527-4531, IEEE, 2020. [doi]

Abstract

Abstract is missing.