Efficient Audio-Visual Speaker Recognition Via Deep Multi-Modal Feature Fusion

Yufei Wang. Efficient Audio-Visual Speaker Recognition Via Deep Multi-Modal Feature Fusion. In 17th International Conference on Computational Intelligence and Security CIS 2021, Chengdu, China, November 19-22, 2021. pages 99-103, IEEE, 2021. [doi]

@inproceedings{Wang21-209,
  title = {Efficient Audio-Visual Speaker Recognition Via Deep Multi-Modal Feature Fusion},
  author = {Yufei Wang},
  year = {2021},
  doi = {10.1109/CIS54983.2021.00029},
  url = {https://doi.org/10.1109/CIS54983.2021.00029},
  researchr = {https://researchr.org/publication/Wang21-209},
  cites = {0},
  citedby = {0},
  pages = {99-103},
  booktitle = {17th International Conference on Computational Intelligence and Security CIS 2021, Chengdu, China, November 19-22, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-9489-2},
}