Asd-Transformer: Efficient Active Speaker Detection Using Self And Multimodal Transformers

Gourav Datta, Tyler Etchart, Vivek Yadav, Varsha Hedau, Pradeep Natarajan, Shih-Fu Chang. Asd-Transformer: Efficient Active Speaker Detection Using Self And Multimodal Transformers. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 4568-4572, IEEE, 2022. [doi]

Authors

Gourav Datta

This author has not been identified. Look up 'Gourav Datta' in Google

Tyler Etchart

This author has not been identified. Look up 'Tyler Etchart' in Google

Vivek Yadav

This author has not been identified. Look up 'Vivek Yadav' in Google

Varsha Hedau

This author has not been identified. Look up 'Varsha Hedau' in Google

Pradeep Natarajan

This author has not been identified. Look up 'Pradeep Natarajan' in Google

Shih-Fu Chang

This author has not been identified. Look up 'Shih-Fu Chang' in Google