Yuanmao Luo, Ruomei Wang 0001, Fuwei Zhang, Fan Zhou 0001, Mingyang Liu, Jiawei Feng. Video Q &A based on two-stage deep exploration of temporally-evolving features with enhanced cross-modal attention mechanism. Neural Computing and Applications, 36(14):8055-8071, May 2024. [doi]
Abstract is missing.