Two-Stream Transformer Architecture for Long Form Video Understanding

Edward Fish, Jon Weinbren, Andrew Gilbert. Two-Stream Transformer Architecture for Long Form Video Understanding. In 33rd British Machine Vision Conference 2022, BMVC 2022, London, UK, November 21-24, 2022. pages 660, BMVA Press, 2022. [doi]

Abstract

Abstract is missing.