Efficient Speaker Naming via Deep Audio-Face Fusion and End-to-End Attention Model

Xin Liu, Jiajia Geng, Haibin Ling. Efficient Speaker Naming via Deep Audio-Face Fusion and End-to-End Attention Model. In 4th IAPR Asian Conference on Pattern Recognition, ACPR 2017, Nanjing, China, November 26-29, 2017. pages 405-410, IEEE Computer Society, 2017. [doi]

@inproceedings{LiuGL17-7,
  title = {Efficient Speaker Naming via Deep Audio-Face Fusion and End-to-End Attention Model},
  author = {Xin Liu and Jiajia Geng and Haibin Ling},
  year = {2017},
  doi = {10.1109/ACPR.2017.13},
  url = {https://doi.org/10.1109/ACPR.2017.13},
  researchr = {https://researchr.org/publication/LiuGL17-7},
  cites = {0},
  citedby = {0},
  pages = {405-410},
  booktitle = {4th IAPR Asian Conference on Pattern Recognition, ACPR 2017, Nanjing, China, November 26-29, 2017},
  publisher = {IEEE Computer Society},
  isbn = {978-1-5386-3354-0},
}