Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing

Ross Cutler, Ramin Mehran, Sam Johnson, Cha Zhang, Adam Kirk, Oliver Whyte, Adarsh Kowdle. Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 4527-4531, IEEE, 2020. [doi]

@inproceedings{CutlerMJZKWK20,
  title = {Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing},
  author = {Ross Cutler and Ramin Mehran and Sam Johnson and Cha Zhang and Adam Kirk and Oliver Whyte and Adarsh Kowdle},
  year = {2020},
  doi = {10.1109/ICASSP40776.2020.9053171},
  url = {https://doi.org/10.1109/ICASSP40776.2020.9053171},
  researchr = {https://researchr.org/publication/CutlerMJZKWK20},
  cites = {0},
  citedby = {0},
  pages = {4527-4531},
  booktitle = {2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020},
  publisher = {IEEE},
  isbn = {978-1-5090-6631-5},
}