SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition

Xing Zhang, Zuxuan Wu, Yu-Gang Jiang. SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. IEEE Transactions on Multimedia, 24:313-322, 2022. [doi]

Abstract

Abstract is missing.