Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function

Qing Wang 0008, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee. Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. In Kong-Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong, editors, 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022. pages 250-254, IEEE, 2022. [doi]

@inproceedings{WangCJWWDL22,
  title = {Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function},
  author = {Qing Wang 0008 and Hang Chen and Ya Jiang and Zhe Wang and Yuyang Wang and Jun Du and Chin-Hui Lee},
  year = {2022},
  doi = {10.1109/ISCSLP57327.2022.10037995},
  url = {https://doi.org/10.1109/ISCSLP57327.2022.10037995},
  researchr = {https://researchr.org/publication/WangCJWWDL22},
  cites = {0},
  citedby = {0},
  pages = {250-254},
  booktitle = {13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022},
  editor = {Kong-Aik Lee and Hung-yi Lee and Yanfeng Lu and Minghui Dong},
  publisher = {IEEE},
  isbn = {979-8-3503-9796-3},
}