Hierarchical multi-modal fusion with vision transformers for robust action recognition in infrared-visible videos

Javed Imran, Mohammed Wasid. Hierarchical multi-modal fusion with vision transformers for robust action recognition in infrared-visible videos. IJMIR, 14(4):36, December 2025. [doi]

Abstract

Abstract is missing.