Video-text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network

Gang Lv, Yining Sun, Fudong Nian. Video-text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network. Multimedia Syst., 30(1):35, February 2024. [doi]

Abstract

Abstract is missing.