MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation

Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, Chen Qian 0006, Ran He, Yu Qiao 0001, Chen Change Loy. MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XXI. Volume 12366 of Lecture Notes in Computer Science, pages 700-717, Springer, 2020. [doi]

@inproceedings{WangWSYWQHQL20,
  title = {MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation},
  author = {Kaisiyuan Wang and Qianyi Wu and Linsen Song and Zhuoqian Yang and Wayne Wu and Chen Qian 0006 and Ran He and Yu Qiao 0001 and Chen Change Loy},
  year = {2020},
  doi = {10.1007/978-3-030-58589-1_42},
  url = {https://doi.org/10.1007/978-3-030-58589-1_42},
  researchr = {https://researchr.org/publication/WangWSYWQHQL20},
  cites = {0},
  citedby = {0},
  pages = {700-717},
  booktitle = {Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XXI},
  editor = {Andrea Vedaldi and Horst Bischof and Thomas Brox and Jan-Michael Frahm},
  volume = {12366},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-030-58589-1},
}