The following publications are possibly variants of this publication:
- Deep Learning for Audio Visual Emotion RecognitionTassadaq Hussain, Wenwu Wang, Nidhal Bouaynaya, Hassan M. Fathallah-Shaykh, Lyudmila Mihaylova. fusion 2022: 1-8 [doi]
- Human action recognition based on multi-mode spatial-temporal feature fusionDongli Wang, Jun Yang, Yan Zhou. fusion 2019: 1-7 [doi]
- Audio-visual tracking of a variable number of speakers with a random finite set approachVolkan Kilic, Xionghu Zhong, Mark Barnard, Wenwu Wang, Josef Kittler. fusion 2014: 1-7 [doi]