Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function

Qing Wang 0008, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee. Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. In Kong-Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong, editors, 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022. pages 250-254, IEEE, 2022. [doi]

Authors

Qing Wang 0008

This author has not been identified. Look up 'Qing Wang 0008' in Google

Hang Chen

This author has not been identified. Look up 'Hang Chen' in Google

Ya Jiang

This author has not been identified. Look up 'Ya Jiang' in Google

Zhe Wang

This author has not been identified. Look up 'Zhe Wang' in Google

Yuyang Wang

This author has not been identified. Look up 'Yuyang Wang' in Google

Jun Du

This author has not been identified. Look up 'Jun Du' in Google

Chin-Hui Lee

This author has not been identified. Look up 'Chin-Hui Lee' in Google