MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers

Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar. MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers. In 11th European Workshop on Visual Information Processing, EUVIP 2023, Gjovik, Norway, September 11-14, 2023. pages 1-6, IEEE, 2023. [doi]

Abstract

Abstract is missing.