C4AV: Learning Cross-Modal Representations from Transformers

Shujie Luo, Hang Dai, Ling Shao 0001, Yong Ding 0003. C4AV: Learning Cross-Modal Representations from Transformers. In Adrien Bartoli, Andrea Fusiello, editors, Computer Vision - ECCV 2020 Workshops - Glasgow, UK, August 23-28, 2020, Proceedings, Part II. Volume 12536 of Lecture Notes in Computer Science, pages 33-38, Springer, 2020. [doi]

Abstract

Abstract is missing.