Video Q &A based on two-stage deep exploration of temporally-evolving features with enhanced cross-modal attention mechanism

Yuanmao Luo, Ruomei Wang 0001, Fuwei Zhang, Fan Zhou 0001, Mingyang Liu, Jiawei Feng. Video Q &A based on two-stage deep exploration of temporally-evolving features with enhanced cross-modal attention mechanism. Neural Computing and Applications, 36(14):8055-8071, May 2024. [doi]

Abstract

Abstract is missing.