Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos

Zishun Feng, Ming Tu, Rui Xia, Yuxuan Wang, Ashok Krishnamurthy. Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos. In Xintao Wu, Chris Jermaine, Li Xiong 0001, Xiaohua Hu 0001, Olivera Kotevska, Siyuan Lu, Weija Xu, Srinivas Aluru, ChengXiang Zhai, Eyhab Al-Masri, Zhiyuan Chen 0003, Jeff Saltz 0001, editors, IEEE International Conference on Big Data, Big Data 2020, Atlanta, GA, USA, December 10-13, 2020. pages 5671-5672, IEEE, 2020. [doi]

Abstract

Abstract is missing.