A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset

Javad Peymanfard, Samin Heydarian, Ali Lashini, Hossein Zeinali, Mohammad Reza Mohammadi, Nasser Mozayani. A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset. Expert Syst. Appl., 238(Part E):121648, March 2024. [doi]

@article{PeymanfardHLZMM24,
  title = {A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset},
  author = {Javad Peymanfard and Samin Heydarian and Ali Lashini and Hossein Zeinali and Mohammad Reza Mohammadi and Nasser Mozayani},
  year = {2024},
  month = {March},
  doi = {10.1016/j.eswa.2023.121648},
  url = {https://doi.org/10.1016/j.eswa.2023.121648},
  researchr = {https://researchr.org/publication/PeymanfardHLZMM24},
  cites = {0},
  citedby = {0},
  journal = {Expert Syst. Appl.},
  volume = {238},
  number = {Part E},
  pages = {121648},
}