A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset

Javad Peymanfard, Samin Heydarian, Ali Lashini, Hossein Zeinali, Mohammad Reza Mohammadi, Nasser Mozayani. A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset. Expert Syst. Appl., 238(Part E):121648, March 2024. [doi]

Abstract

Abstract is missing.